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Abstract 

The spatial interaction between two or more classes (or species) has important consequences in many fields 
and might cause multivariate clustering patterns such as segregation or association. The spatial pattern 
of segregation occurs when members of a class tend to be found near members of the same class (i.e., 
conspecifics) , while association occurs when members of a class tend to be found near members of the 
other class or classes. These patterns can be tested using a nearest neighbor contingency table (NNCT). 
The null hypothesis is randomness in the nearest neighbor (NN) structure, which may result from — 
among other patterns — random labeling (RL) or complete spatial randomness (CSR) of points from 
two or more classes (which is called CSR independence, henceforth). In this article, we consider Dixon's 
class-specific tests of segregation and introduce a new class-specific test, which is a new decomposition 
of Dixon's overall chi-square segregation statistic. We demonstrate that the tests we consider provide 
information on different aspects of the spatial interaction between the classes and are conditional under 
the CSR independence pattern, but not under the RL pattern. We analyze the distributional properties 
and prove the consistency of these tests; compare the empirical significant levels (Type I error rates) and 
empirical power estimates of the tests using extensive Monte Carlo simulations. We demonstrate that 
the new class-specific tests also have comparable performance with the currently available tests based on 
NNCTs in terms of Type I error and power. For illustrative purposes, we use three example data sets. 
We also provide guidelines for using these tests. 

Keywords: Association; clustering; completely mapped data; complete spatial randomness; random labeling; spatial 
point pattern 
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1 Introduction 



Spatial patterns have important implications in various fields, such as epidemiology, population biology, and ecology. 
A spatial point pattern includes the locations of some measurements, such as the coordinates of trees in a region 
of interest. These locations are referred to as events by some authors, in order to distinguish them from arbitrary 
points in the region of interest (iDiggld (|2003l )). However in this article such a distinction is not necessary, as we 
only consider the locations of events. Hence points will refer to the locations of events, henceforth. It is of p r actica l 
interest to inve s tigate the patterns o f one type of points with respect to other types. See, for example. IPielorj (|l96ll ), 
IWhipplj (|l980l ). [Pixonl (|l994l . [2002ah . In fact, most point patterns marked point patterns generated by marked point 
processes, which define the distributions of t he "marks" o r "class labels" to the l ocations of the points and pe rhaps 
are the most common spatial point patterns (|Digglel (|2003l ). [Cavrikov and StovanI (|l995l ), [Penttinen et al.l (|l992h . and 
ISchlather" et all (|2004l )). For convenience and generality, we call the different types of points as "classes", but the 
class can stand for any characteristic of an obs ervation at a particular location. For example, the spatia l segre gation 
pattern has been investigated for p lant species (jPiggld (|2003l )). age classes of plants jHamill and Wright! |l98^)), fish 
species (|Herler and Patznerl (|2005f )). and sexes of dioecious plant s (pSFanami et al.l (IT999I)). M any of the epidemiologic 



applications are for a two-class system of case and control labels l|Waller and Gotwavl (|2004l )') 



In the analysis of multivariate point patterns, the null pattern is usually one of the two (random) pattern types: 
random labeling (RL) or complete spatial randomness (CSR) of two or more classes (i.e., CSR independence). We 
consider two major types of spatial patterns as alternatives: association and segregation. Association occurs if the 
nearest neighbor (NN) of an individual is more likely to be from another class than to be from the same class. 
Segregation occurs if t he NN of an i ndividual is more likely to be of the same class as the individual than to be of 
another class (see, e.g.. iPieloul (|l96ll )). 



Many tests of spatial segr egation have b een proposed in the literature jKulldorfi (|2006|)). These include compariso n 



of Ripley's K or L- functions (|Riplevl (|2004l ')'). comparison of NN distances (|Digglel (|2003l '). ICuzick and Edwardd (|l990l ')') 



and analysis o f nearest neighbor contingency t ables (NNCT s) which are constructed using the NN frequencies of classes 
(|Pielorj (Il96ll). Meagher and Burdickl (|l980f )). IPieloul (|l96ll ) proposed tests (of segregation, symmetry, niche specificity, 



etc.) and lDixonI (Il994l) int roduce d an overall test of segregation, cell- and class-s pecific tests based on NNCTs for the 



two-class case and iDixonI (|2002ah extended his tests to multi-class case. ICevhan (2008bi) proposed new cell-specific 
and overall segregation tests which are more robust to the differences in the relative abundance of classes and have 
better performance in terms of size and power. The tests (i.e., inference) based on Ripley's K or L- functions are only 
appropriate when the null pattern can be assumed to be the CSR independen ce pattern , if the null pattern is the RL 
of points from an inhomogeneous Poisson pattern, they are no t appropriate (iKulldorfj (|2006l )). But, there are also 
variants of K{t) that explicitly correct for inhomogeneity (see iBaddelev et al'l (|2000l )). Cuzick and Edward's fc-NN 
tests are designed for testing bivariate spatial interaction and mostly used for s patial clusterin g of cases or controls in 
epidemiology. Biggie's D- function is a modified version of Ripley's Jf- function jPigglel (|2003l )) and is appropriate for 
the case in which the null pattern is the RL of points from any point pattern. However, Ripley's and Biggie's functions 
are designed to analyze univariate or bivariate spatial interaction at various scales (i.e., inter-point distances). On 
the other hand, Dixon's overall segregation test is a compound summary statistic and applicable to test (small-scale) 
multivariate spatial interaction between classes in a given study area; (base-)class-specific test provides information 
on the NN distribution of a base class (if {X, Y) is a pair of points in which Y is the closest point to X, then X 
is the base point and Y is the NN point), while NN-class-specific test (which is introduced in this article) provides 
the (base) distribution of classes to which members of a class serve as NNs. Hence, the NNCT-tests answer different 
questions; i.e., they provide information about different aspects of the spatial interaction (at small scales) compared 
to each other and the other tests in literature. 

IPieloul Hiil) used the usual Pearson's test of independence for testing spati al segregation. Due to the ea se in 
computation and interpretation, Pielou's test of segregation has been used frequently (|Meagher a nd Bur dick (Il980l)) for 



both complete l y map ped or sparsely sampled data, although it is not appropriate for such data (Meag her and Burdickl 
(|l980l ). [Dixonl ( 1994 )). For example, Piel ou's test is used for testing the segregation between males and females in dioe- 
cious s pecies (see, e.g.. lHerreral (|l988l ) aud i Armstrong and Irviiid (| 19891 )). and between different species (|Good and Whipple! 
|l98i)). 

In this article, we discuss Dixon's overall and class-specific test of segregation and introduce a new class-specific 
test. We only consider completely mapped data; i.e., for our data sets, the locations of all events in a defined space 
are observed. We provide the null and alternative patterns in Section (2] describe the NNCTs in Section [S] provide 
the base- and NN-class-specific tests in Sectional empirical significance levels for the two-class case in Section [SI for 
the three-class case in Section |6l empirical power comparisons for the segregation and association alternatives for 
the two-class case in Section [71 for the three-class case in Section [8] examples in Section [9] and our conclusions and 
guidelines for using the tests in Section [TPl 



2 Null and Alternative Patterns 



In this section, for simplicity, we describe the spatial point patterns for two classes only; the extension to multi-class 
case is straightforward. 

In the univariat e (i.e., one-cla ss) spatial point pattern analysis, the null hypothesis is usually complete spatial 
randomness (CSR) (|Digglel (|2003l 'l'). Given a spatial point pattern V = {Xi ■ I{Xi € D),i = l,...,n : D C R^} 
where Xi stands for the location of event i (i.e., point i) and l{Xi £ D) is the indicator function which denotes the 
Bernoulli random variable denoting the event that point i is in region D. The pattern V exhibits CSR if given n events 
in domain D, the events are an independent random sample from the uniform distribution on D. This implies that 
there is no spatial interaction; i.e., the locations of these points have no influence on one another. Furthermore, when 
the reference region D is large, the number of points in any planar region with area A{D) follows (approximately) a 
Poisson distribution with intensity A and mean A • A{D). 

To investigate the spatial interaction between two or more classes in a bivariate process, usually there are two 
benchmark hypotheses: (i) independence, which implies two classes of points are generated by a pair of independent 
univariate processes and (ii) random labeling (RL), which i mplies that the class labels are randomly assigned to a 
given set of locations in the region of interest l|Digglel (|2003l )). In complete spatial randomness (CSR) independence, 
points from each of the two classes satisfy the CSR pattern in the region of interest. On the other hand, RL is the 
pattern in which, given a fixed set of points in a region, class labels are assigned to these fixed points randomly so that 
the labels are independent of the locations. So, RL is less restrictive than CSR independence. CSR independence is 
a process defining the spatial distribution of event locations, while RL is a process, conditioned on locations, defining 
the distribution of labels on these locations. 

We consider either pattern as our null hypotheses in this article. That is, when the points are assumed to be 
uniformly distributed over the region of interest, then the null hypothesis we consider is 

Ho : CSR independence 

and when only the labeling (marking) of a set of fixed points, where the allocation of the points might be regular, 
aggregated, clustered, or of lattice type, is considered, our null hypothesis is 

Ho : RL. 

We discuss the differences in practice and theory for both cases. The distinction between CSR independence and RL 
is very important when defining the appropriate null m o del fo r each empirical case; i.e., the null model depends on 
the particular ecological context. ICoreaud and Pelissied (|2003l ) assert that CSR (independence) implies that the two 
classes are a priori the result of different processes (e.g., individuals of different species or age cohorts), whereas RL 
implies that some processes affect a posteriori the individuals of a single population (e.g., diseased vs. non-diseased 
individuals of a single species). Although CSR independence and RL are not same, they lead to the same null model 
(i.e., randomness in NN structure) in tests using NNCT, which does not require spatially-explicit information. We 
provide the differences in the proposed tests under either null hypotheses. 

As clustering alternatives, we consider two major types of spatial patterns: association and segregation. Association 
occurs if the NN of an individual is more likely to be from another class than to be of the same class as the individual. 
For example, in plant biology, the two classes of points might represent the coordinates of mutualistic plant species, 
so the species depend on each other to survive. As another example, one class of points might be the geometric 
coordinates of parasitic plants exploiting the other plant whose coordinates are of the other class. In epidemiology, 
one class of points might be the geographical coordinates of contaminant sources, such as a nuclear reactor, or a factory 
emitting toxic waste, and the other class of points might be the coordinates of the residences of cases (incidences) of 
certain diseases, e.g., some type of cancer caused by the contaminant. 

Segregation occurs if the NN of an individual is more likely to be of the same class as the indivi dual th a n to be 
from a different class; i.e., the members of the same class tend to be clumped or clustered (see, e.g., iPieloul (|l96ll )). 
For instance, one type of plant might not grow well around another type of plant, and vice versa. In plant biology, 
points from one class might represent the coordinates of trees from a species with large canopy, so that other plants 
(whose coordinat es are the poi nts fr om the other c l ass) t hat need light cannot grow (well or at all) around these trees. 
See, for instance. iDixonI (|l994l ') and lCoomes et al" I (|1999D . 

The two patterns of segregation or association are not symmetric in the sense that, when two classes are segregated 
(associated), they do not necessarily exhibit the same degree of segregation (association). Many different forms of 
segregation (and association) are possible. Although it is not possi ble to l i st all segregation types, its existence can be 
tested by an analysis of the NN relationships between the classes (|Pieloul (|l96ll )). 



3 Nearest Neighbor Contingency Tables 



NNCTs are constructed using the NN frequencies of classes. We describe the construction of NNCTs for two classes 
from a binomial spatial process; extension to multi-class case is straightforward. Consider two classes with labels 
{1, 2}. Let Ni be the number of points from class i for i £ {1, 2} and n be the total sample size, so n = A'^i + N2. 
If we record the class of each point and the class of its NN, the NN relationships fall into four distinct categories: 
(1, 1), (1,2); (2, 1), (2,2) where in cell {i,j), class i is the base class, while class j is the class of NN of class i. That 
is, the n points constitute n (base,NN) pairs. Then each pair can be categorized with respect to the base label (row 
categories) and NN label (column categories). Denoting Nij as the frequency of cell {i,j) for i,j € {1,2}, we obtain 
the NNCT in Table[T]where Cj is the sum of column j; i.e., number of times class j points serve as NNs for j £ {1, 2}. 
Furthermore, Nij is the cell count for cell {i,j) that is the sum of all (base,NN) pairs each of which has label 
Note also that n = j Nij; Ui = X^j^i Nij; and Cj = X^i^i ^ij- construction, if Nij is larger (smaller) than 
expected, then class j serves as NN more (less) to class i than expected, which implies (lack of) segregation if i = j 
and (lack of) association of class j with class i ii i ^ j. Furthermore, we adopt the convention that variables denoted 
by upper (lower) case letters are random (fixed) quantities. Hence, column sums and cell counts are random, while 
row sums and the overall sum are fixed quantities in a NNCT. However if we want to have the overall sum to be 
random also, we might consider a Poisson spatial process. 
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Table 1: NNCT for two classes. 



Note that, under segregation, the diagonal entries, Na for j = 1,2, tend to be larger than expected; while under 
association, the off-diagonals tend to be larger than expected. The general two-sided alternative is that some cell 
counts are different than expected under the null hypothesis (i.e., CSR independence or RL). 

iPieloul (|l96lf ) used test of independence to test for segregation. The assumption for the use of chi-square 
test for NNCTs is the independence between cell-cou nts (and rows and columns al so), which is violated under CSR 
independence or RL. This problem was first noted bv lMeagher and" Burdickl (|l980D who identify the main source of 
it to be reflexivity of (base, NN) pairs. A (base,NN) pair {X, Y) is reflexive if (Y,X) is a lso a (base,NN) pair. As 
an alternative, they suggest using Monte Carlo simulations for Pielou's test. iDixonI jl994h derived the appropriate 
asymptotic sampling distribution of cell counts using Moran join count statistics l|Moranl (| 19481 )) and hence the 
appropriate test which also has a x^-distribution asymptotically. 



4 Class-Specific Tests of Spatial Segregation 

In this section, we discuss the row- wise partitioning of the overall segregation test due to lDixoi] l|2002bl ') and introduce 
the column-wise partitioning. These class-specific tests can also be viewed as resulting from combining the cells at 
each row or column, respectively. 



4.1 Base-Class-Specific Test of Spatial Segregation 

Dixon' s overal l test of segregation tests the hypothesis that all cell counts in the NNCT are equal to their expected 
values (|Dbconl (|1994 )). Under RL, these expected cell counts are as 

E[iV,] = |"'^"'^')/("-^) ^^^=^'' (1) 

[71,71^/(71-1) lf*/j, 

where rii is a realization of Ni, i.e., is the fixed sample size for class i for i = 1, 2, . . . , g. Observe that the expected cell 
counts depend only o n the size of each class (i.e., row sums), but not on column sums. In the multi-class case with q 
classes, iDixonI (|2002al ) suggests the quadratic form 

C = (N-E[N])'S~(N-E[N]) (2) 

where N is the x 1 vector of q rows of cell counts concatenated row-wise, E[N] is the vector of E[A'^ii] which are as 
in Equation ((T}, E is the q^ x variance-covariance matrix for the cell count vector N with diagonal entries being 



equal to Var[A^ii] and off-diagonal en tries be i ng Co vfA^j, , Nki] for ^ {k,l). The explicit forms of the varianc e 

and covariance terms are provided in (|Dixonl ((20023))- Furthermore, E~ is a generalized inverse of E (ISearld (120061 )) 
and ' stands for the transpose of a vector or matrix. The overall test statistic C can be partitioned into q class-specific 
(or species-specific) test statistics. Consider the NNs of type i base points, i.e., row i in the NNCT denoted by 
Ni = {Nii,N,2,...,Ni,y. Then 

Ceii) = (N, - E[N,])'S-(N, - E[N,]) (3) 

where E[N,] is the vector of expected cell counts for row i and E~ is the generalized inverse of variance-covariance 
matrix of Ni. Asymptotically Cs(j) has xlq-i) distribution since Ei has rank {q — 1). Furthermore, Csii) are not 
independent, hence do not sum to C. Each Cs(j) tests whether each row i is similar to the one under RL; that is, the 
frequencies of the NNs for base-class i is as expected under RL: 

: E[N,] = {E[N,i], E[N,2], . . . , E[Af,,])' (4) 

where E[A^y] are as in Equation (ITJ. Therefore, we will call this type of class-specific test as base-class-specific test of 
segregation for class i, and hence the notation CB{i)- 

The tests in Equations ((2} and (|3]) depend on the quantities Q and R where Q is the number of points with 
shared NNs, which occurs when two or more points share a NN and R is twice the number of reflexive pairs. Then 
Q = 2 (Q2 -1- 3 Q3 -I- 6 Qi + 10 Qs -I- 15 Q%) where Qi is the number of points that serve as a NN to other points i times. 
Notice that under RL, Q and R axe fixed quantities, as they depend only on the location of the points, not the types 
of NNs. 



4.2 NN-Class-Specific Tests of Spatial Segregation 

Another way to partition C in Equation ^ is by columns instead of rows. Consider the frequency of type j points 
serving as NN to all other types (including class j), i.e., column j, namely, Cj = {N\j , A^2j , • . • , Nqj)' . 

Under RL, for each column j, we have 

CnnU) = (C, - E[C,])'E7HC, - E[C,]), (5) 

where E[Cj] is the vector of expected cell counts for column j and is the inverse of variance-covariance matrix of 
Cj. The test statistics CNN{j) are asymptotically distributed as X91 since Ej has rank q. The test statistics CnnU) 
axe dependent, hence do not sum to C. 

If significant, these class-specific tests imply that class j serves more (or less) frequently as AW to other classes 
(including class j itself) than expected under RL: 

Ho : E[C,] = {E[N^,],E[N2A, ■ ■ ■ ,E[iV,,])' (6) 

where E[A^ij] are as in Equation ([T}. Hence the notation Cjv]v(j). Moreover, we call them NN-class-specific tests of 
segregation which are also suggestive of deviation from RL, which might indicate segregation or association. 

Remark 4.1. The Status of Q and R under CSR Independence and RL: Under CSR independence, the expected 
cell counts in Equation ([T}, the overall test statistic in Equation ([2]), and the base-class-specific test in Equation ((3| 
and NN-class-specific test in Equation ([5]) and the relevant discussions are similar to the RL case. The only difference is 
that under RL the quantities Q and R axe fixed, while under CSR independence they are random. That is, under CSR 
independence, Cs(i) asymptotically has xfq-i) distribution and Cnn^j) asymptotically has Xq distribution conditional 
on Q and R, because the variances and covariances used in Equations ([3| and ((5]), and all the quantities depending on 
these expectations also depend on Q and R. Hence under the CSR independence model they are conditional variances 
and covariances obtained by conditioning on Q and R. The unconditional variances and covariances can be obtained 
by using the conditional expectation formula incorporating Q and R. Letting W — {Q, R)' , we have 

Var[N,j] = Eiv[Var[N,j\W]]+Yarw[E[N,j\W]] 
= Ew [Var [N,j\W]] 

since E[A'^ij|W^] is independent of Q and R. Similarly, 

Cov[N,j,Nki] = Ew[CovlN,j,Nki\W]] + Covw[E[{N,,,Nki)\W]] 
= Ew[CovlN^j,Nki\W]] 

sin ce EKNg , Nki )\W] is independent of Q and R. Furthermore, Var[A''ij and Co'v[Nij , Nki\W] are linear in Q and 
R (|Dixonl (|2002al )). Hence the unconditional variances and covariances can be found by replacing Q and R with their 
expectations. 



Unfortunately, given the difficulty of calculating the expectations of Q and R under CSR independence, it is 
reasonable and convenient to use test statistics emplo ying t he con ditional variances and covariances even when assessing 
their behavior under the CSR independence model. ICoxl (|l98ll ) calculated analytically that E[ii|A*'] = 0.6215A*' for a 
planar Poisson process. Alternatively, we can estimate the expected values of Q and R empirically, substitute these 
estimates in the expressions. For example, for homogeneous planar Poisson pattern, we have E[Q|A'^] « 0.63A'^ and 
'Ei[R\N] « 0.62Ai' (estimated empirically by 1000000 M onte Carlo s imulations for various values of A*' on unit square). 
Notice that E[i?|iV] agrees with the analytical result of lCoxl (|l98lh . □ 

Remark 4.2. Comparison of Base- and NN-Class Specific Tests: Notice that the base-class-specific test for 
class i tests whether the NN frequencies of base class i follows the expected under CSR independence (conditional on 
Q and R) or RL. That is, base-class-specific test for class (or species) i is concerned with how other classes interact 
as NNs with base class i in the NN structure. For example, if [q — 1) plant species are introduced to a region with 
species i being already there, the base-class-specific test for species i is more appropriate and informative as it provides 
information on how the newly introduced species interact with the already existent base species i. On the other hand, 
the NN-class-specific test for class j tests whether the base frequencies for which class j serves as NN follows the 
expected under CSR independence (conditional on Q and R) or RL. So, NN-class-specific test for class j is concerned 
with how class j points interact as NNs with other base classes in the NN structure. For example, if species j was 
introduced to a region with (g — 1) species already present, NN-class-specific test for species j is more appropriate and 
informative, as it provides information on how the newly introduced species interact with the already existent (g — 1) 
base species. □ 



4.3 The Two-Class Case 



In the two-class case. iDixonI l|l994 ) calculates Zu — [Na — E[Nii]) / ^Va.r[Nii] for both i £ {1, 2} and then combines 
these test statistics into a statistic that is asymptotically distributed as X2- The suggested test statistic is given by 



C = Y'E^^Y = 



TVn -E[An] 

N22 - E[N22] 



Notice that this is equivalent to C = 



Zaa + Zbb 
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where Zaa = 



All - E[Aii] 
^Var[Aii] 



A22 - E[A2: 



x/Var[A2: 



and r — Cov[Aii , A22]/\/Var[Aii] Var[A22]. The overall test statistic C can be partitioned into two base-class- 
specific test statistics CB{i) which asymptotically have xu the chi-square distribution with 1 degree of freedom. 

When C is partitioned by columns Cj, we get the NN-class-specific test CnnU) which are asymptotically dis- 

Cjvjv(l) ~ Cnn{'2) = C, since each column Cj contains all the 



tributed as xi.- However, in the two-class case, 
information about the NNCT. 



Remark 4.3. The Multi-Class Case with q > 2 Classes: In the multi-class case with more than 2 classes, the 
overall test statistic C can be partitioned into q base-class-specific test statistics Cnii) which asymptotically have Xq-i- 
When C is partitioned by columns Cj, we get q NN-class-specific tests Cjvjv(j) which are asymptotically distributed 
cis Xq- In the case of g > 2, CnnU) ^^'^ Cnn{1) are very likely to be different for j 7^ I and CnnU) ^-^^d C are also 
very likely to be different for each j. □ 



4.4 Asymptotic Structures for the NNCT-Tests 

There are two major types of asymptotic structures for spatial data (|Lahiril (|l996l )'). In the first, any two observations 
are required to be at least a fixed distance apart, hence as the number of observations increase, the region on which 
the process is observed eventually becomes unbounded. This type of sampling structure is called increasing domain 
asymptotics. In the second type, the region of interest is a fixed bounded region and more and more points are observed 
in this region. Hence the minimum distance between d ata poin t s ten ds to zero as the sample size tends to infinity. 
This type of structure is called infill asymptotics, due to ICressi 3 (|1993|). 

Under RL, the sampling structure in our asymptotic sampling distribution could be either one of increasing domain 
or infill asymptotics, as we only consider the class sizes and hence the total sample size tending to infinity regardless 
of the size of the study region. That is, under RL, the locations on which labels are assigned randomly could be 
realizations from a process whose asymptotic structure follows either infill or increasing domain asymptotics. On the 
other hand, under CSR independence, we can not have the increasing domain asymptotics in the usual sense, since 
for any bounded subset of the region, the points will not necessarily follow the uniform distribution as there is a 
restriction on the interpoint distances. But if we let the area of the region go to infinity while uniformness is preserved 
for any bounded subset of the region, this type of "modified" increasing domain asymptotics is applicable for the CSR 
independence pattern. Furthermore, under CSR independence, the number of points from uniform distribution on a 



bounded region going to infinity will give rise to the infill asymptotics. However, the values and distributions of Q and 
R axe affected by infill and increasing domain asymptotics. In both asymptotic structures, under CSR independence, 
the tests will still be conditional on Q and R, whose asymptotic distribution will also depend on the type of the 
asymptotics. For the actual conditional result, the only possible asymptotics for which the current arguments might 
be appropriate is what might be called pattern stamp asymptotics. In this form of asymptotics, the study region 
(assumed to be rectangular or at least something that will tile the plane) and its locations are repeated (stamped). 
This multiple stamping of the same pattern of locations maintains Q and R at fixed constants. However, this is not 
either of the usual asymptotics for the spatial data nor a realistic case in practice. 

Theorem 4.4. (Consistency) Under RL, the overall test in Equation Q which rejects for C > Xq(q-i){^ ~ Qf) where 
(1 ~ Q^) 100(1 — a)"* percentile of x^^g-i) distribution, the test against Ho in Equation Q which rejects for 

Csii) > X(q-i)(l ~ Q;)) 'm^'^ t^^t against Ho in Equation which rejects for CnnU) > Xq(l ~ "-^^ consistent. 
Under CSR independence, consistency follows conditional on Q and R. 

Proof: The consistency of the overall test is proved in (|Cevhanl (|2008ah '). Under RL, the null hypothesis in 
Equation Q is equivalent to Ho : A = where A is the non-centrality parameter for the xfq-i) distribution. Any 
deviation from the null case corresponds to Ha : A > 0, then under any specific alternative the power tends to unity 
by the consistency of usual tests. The consistency of Cjvjv(j) under RL and of both tests under CSR independence 
follow similarly. ■ 



5 Empirical Significance Levels for the Two-Class Case 

In this section, we provide the empirical significance levels for base- and NN-class-specific tests and Dixon's overall 
test of segregation for the two-class case under CSR independence and RL. 

5.1 Empirical Significance Levels for the Two-Class Case under CSR Indepen- 
dence 

First, we consider the two-class case with classes X and Y. We generate ni points from class X and n2 points from 
class Y both of which are independently uniformly distributed on the unit square, (0, 1) x (0, 1). Hence, all X points are 
independent of each other and so are Y points; and X and Y axe independent data sets. This will imply randomness 
in the NN structure, which is the null hypothesis for our NNCT-tests. Thus, we simulate the CSR independence 
pattern for the performance of the tests under the null case. We generate X and Y points for some combinations of 
'^l,f^2 £ {10,30,50, 100}. We repeat the sample generation Nmc ~ 10000 times for each sample size combination for 
a reasonable precision in the results within a reasonable time period. For each Monte Carlo replication, we construct 
the NNCT for classes X and Y, then compute the base- and NN-class-specific tests for each class and Dixon's overall 
segregation test. Out of these 10000 samples the number of significant results by each test is recorded. The nominal 
significance level used in all these tests is a = 0.05. Then empirical sizes are calculated as, e.g., Sf = mi/ Nmc where 
mi is the number of significant base-class-specific test statistics for class i. The empirical sizes for NN-class-specific 
tests and Dixon's overall segregation test are calculated similarly. 

We present the empirical significance levels for the NNCT-tests in Tabled where 3f is the empirical significance 
level for (Dixon's) base-class-specific test and Sf^ is for the NN-class-specific test for i G {1, 2} and od is for Dixon's 
overall test of segregation. The empirical sizes significantly smaller (larger) than 0.05 are marked with " (*), which 
indicate the corresponding test is conservative (liberal). The asymptotic normal approximation to proportions are 
used in determining the significance of the deviations of the empirical sizes from the nominal level of 0.05. For the 
tests regarding the proportions, we also use a — 0.05 to test against empirical size being equal to 0.05. Notice that in 
the two-class case Sf 7^ Sf but Sf ^ — Sf ^ — old, so only Sf ^ is presented. 

Observe that (Dixon's) base-class-specific test for class Y (i.e., the larger class in unequal sample size combinations) 
yields about the desired significance levels in rejecting Ho ■ CSR independence for all sample size combinations. 
However, when one of the samples is small the tests are conservative, but base-class-specific test for class X (i.e., the 
class with the smaller sample size) is more conservative than others. For large classes with unequal sizes (i.e., when 
the relative abundance of classes is different), the base-class-specific test tends to be slightly liberal for the smaller 
class. 



Empirical significance levels 
under CSR independence 



sizes 


base 


NN* 


{ni,n2) 


Sf 






(10,10) 


.0454^ 


.0465 


.0432= 


(10,30) 


.0306= 


.0485 


.0440= 


(10,50) 


.0270= 


.0464 


.0482 


(30,30) 


.0507 


.0505 


.0464 


(30,50) 


.0590* 


.0522 


.0443= 


(50,50) 


.0465 


.0469 


.0508 


(50,100) 


.0601^ 


.0533 


.0560* 


(100,100) 


.0493 


.0463= 


.0504 



Table 2: The empirical significance levels for the NNCT-tests in the two-class case under Ho : 
CSR independence with Nmc ~ 10000, ni,n2 in {10,30,50,100} at the nominal level of a = 0.05. (=: 
empirical size significantly less than 0.05; i.e., the test is conservative. ^: empirical size significantly larger 
than 0.05; i.e., the test is liberal. *only is presented since a^'^ = 012^ = an- base = base-class-spccific 
test, NN = NN-class-spccific test.) 

5.2 Empirical Significance Levels for the Two-Class Case under RL 

Recall that the segregation tests we consider are conditional under the CSR independence pattern. To evaluate their 
empirical size performance better, we also perform Monte Carlo simulations under the RL pattern, for which the tests 
are not conditional. For the RL pattern we consider the following three cases, in each of which, we first determine the 
locations of points for which labels are to be assigned randomly. Then we apply the RL procedure to these points for 
the appropriate sample size combinations. 

RL Case (1): First, we generate n — (ni + n2) points iid U{{0, 1) x (0, 1)), the uniform distribution on the unit 
square, for some combinations of n\,n2 € {10,30,50, 100}. The locations of these points are taken to be the fixed 
locations for which we assign the labels randomly. Thus, we simulate the RL pattern for the performance of the tests 
under the null case. For each sample size combination (711,712), we randomly choose n\ points (without replacement) 
and label them as X and the remaining n2 points as Y points. We repeat the RL procedure Nmc = 10000 times for 
each sample size combination. For each RL iteration, we construct the NNCT for classes X and Y , and then compute 
the base- and NN-class-specific tests for each class and Dixon's overall segregation test. Out of these 10000 samples the 
number of significant tests by each test is recorded. The nominal significance level used in all these tests is a = 0.05. 
Based on these significant results, empirical sizes are calculated as the ratio of number of significant test statistics to 
the number of Monte Carlo replications, Nmc- 

RL Case (2): We generate m points iid W((0, 2/3) x (0,2/3)) and n-z points iid W((l/3, 1) x (1/3,1)) for some 
combinations of 711, 712 £ {10, 30, 50, 100}. The locations of these points are taken to be the fixed locations for which 
we assign labels randomly. The RL process is applied to these fixed points Nmc ~ 10000 times for each sample size 
combination. The empirical sizes for the tests are calculated similarly as in RL case (1). 

RL Case (3): We generate 711 points iid W((0, 1) x (0,1)) 712 points iid W((2,3) x (0,1)) for some combinations of 
711, 712 £ {10, 30, 50, 100}. RL procedure and the empirical sizes for the tests are calculated similarly as in the previous 
RL cases. 

The locations for which the RL procedure is applied in RL Cases (l)-(3) are plotted in Figure[T]for 711=712 = 100. 
We present the empirical significance levels for the NNCT-tests in Table O where the empirical significance level 
labeling is as in Tabled The empirical sizes are marked with and ^ for conservativeness and liberalness as in Section 
15.11 Observe that when both samples are small, base-class-specific tests are liberal under RL Case (2), conservative 
under other RL cases, and all other tests (NN-class-specific tests and overall segregation test) are conservative under 
all RL cases. When one sample is small, the tests tend to be conservative, especially for the base-class-specific test for 
the smaller class. When both samples are large, the base-class-specific test is liberal for the smaller class, and other 
tests are about the desired level. 

Comparing Tables [2] and |3] we observe that the empirical sizes are about the same under the CSR independence 
and RL patterns. However, the tests are conservative when at least one sample is small, regardless of whether the null 
case is CSR independence or RL. 



RL Case (1 ) RL Case (2) RL Case (3) 




0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 O.S 0.8 1.0 0.0 0.5 1.0 1.5 2.0 2.5 3.0 

X coordinate x coordinate x coordinate 



Figure 1: The fixed locations for which RL procedure is applied for RL Cases (l)-(3) with ni = n2 = 100 in 
the two-class case. Notice that x-axis for RL Case (3) is differently scaled. 



Empirical significance levels under RL 




RL Case (1) 


RL Case (2) 


RL Case (3) 


sizes 


base 


NN* 


base 


NN* 


base 


NN* 


{ni,n2) 


Sf 




Sf^ 






S^^ 






Sf^ 


(10,10) 


.0356= 


.0341= 


.0491 


.0624* 


.0657* 


.0446= 


.0444= 


.0481* 


.0404= 


(10,30) 


.0311= 


.0699* 


.0466 


.0297= 


.0341= 


.0327= 


.0281= 


.0447= 


.0324= 


(10,50) 


.0264= 


.0472 


.0507 


.0251= 


.0384= 


.0508 


.0260= 


.0404= 


.0500 


(30,30) 


.0579 


.0547 


.0497 


.0513 


.0523 


.0469 


.0549* 


.0553* 


.0484 


(30,50) 


.0621* 


.0608* 


.0444= 


.0626^ 


.0594* 


.0411= 


.0677* 


.0685* 


.0445= 


(50,50) 


.0512 


.0524 


.0497 


.0509 


.0511 


.0501 


.0504 


.0506 


.0488 


(50,100) 


.0625* 


.0512 


.0482 


.0566* 


.0421= 


.0460 


.0590* 


.0484 


.0479 


(100,100) 


.0538*^ 


.0534 


.0525 


.0439= 


.0453= 


.0505 


.0495 


.0476 


.0534 



Table 3: The empirical significance levels for the two-class case under Ho : RL for RL Cases (l)-(3) with 
Nriic = 10000, ni, n2 in {10, 30, 50, 100} at the nominal level of a = 0.05. (=: empirical size significantly less 
than 0.05; i.e., the test is conservative. *: empirical size significantly larger than 0.05; i.e., the test is liberal. 
*only cii^ is presented since a^'^ — — an- base — base-class-specific test, NN = NN-class-specific 
test.) 

6 Empirical Significance Levels for the Three-Class Case 

In this section, we provide the empirical significance levels for the base- and NN-class-specific tests and Dixon's overall 
test of segregation for the three-class case under CSR independence and RL. 

6.1 Empirical Significance Levels for the Three-Class Case under CSR Indepen- 
dence 

In the two-class case, NN-class-specific tests are equal to the overall test of segregation. But for the g-class case 
with q > 2, neither CnnH) are likely to be equal to each other nor any of them are likely to be equal to C for 
j = 1,2, Therefore, in order to better compare the performance of NN-class-specific tests with base-class- 

specific tests, we consider the three-class case with classes X, Y, and Z under CSR independence. We generate 
ni, 712, 7i3 points distributed independently uniformly on the unit square (0,1) x (0,1) from classes X, Y, and Z, 
respectively. That is, each data set from classes X, Y, and Z enjoy within sample and between sample independence. 
We generate data points for some combinations of ni,n2,n3 £ {10, 30, 50, 100}; and for each sample size combination, 
we generate data sets X, Y, and Z for Nmc = 10000 times. The empirical sizes and the significance of their deviation 
from 0.05 are calculated as in Section [5. II 

The empirical significance levels for the three-class case are presented in Table U where empirical sizes significantly 
smaller (larger) than 0.05 are marked with (*), which indicate the corresponding test is conservative (liberal). Notice 
that when at least one class is small (i.e., rii < 10) tests are usually conservative, especially the base-class-specific 



tests for the smaller classes. When all classes are large, the tests are about the desired level, and conservative for 
a few sample size combinations. For the three-class case, we conclude that the NN-class-specific tests exhibit better 
performance than the base-class-specific tests in terms of empirical size. 



Empirical significance levels under CSR independence 



sizes 


base 


NN 


overall 


(ni, 7i2, 713) 


"i 


"2 






"2 






/'in in 1 

^iU;iU,iU ) 


n/ini c 

.U4U1 


n/i 'io'^ 

.U4oz 


CiA "IRC 




.U4:ZZ 


n/inn^ 

.U4UU 


.U4zl 


/I n 1 n "itW 


.0416^= 


.0422^= 


.0493 


.0448^= 


.0436^= 


.0450^= 


0/1/1 


(10,10,50) 


.0356^ 


.0350^= 


.0449^= 


.0489 


.0495 


.0463^= 


.0510 


(10,30,30) 


.0369'= 


.0522 


.0514 


.0487 


.0433^= 


.0503 


.0439^= 


(10,30,50) 


.0525 


.0432^= 


.0446^= 


.0473 


.0434^= 


.0444^= 


.0450^= 


(10,50,50) 


.0582*^ 


.0492 


.0487 


.0422^= 


.0519 


.0528 


.0566* 


(30,30,30) 


.0487 


.0426^= 


.0466 


.0511 


.0492 


.0498 


.0497 


(30,30,50) 


.0488 


.0476 


.0485 


.0457^= 


.0498 


.0479 


.0463^ 


(30,50,50) 


.0467 


.0508 


.0504 


.0461^= 


.0492 


.0520 


.0486 


(50,50,50) 


.0504 


.0509 


.0493 


.0525 


.0496 


.0483 


.0497 


(50,50,100) 


.0513 


.0481 


.0499 


.0479 


.0503 


.0485 


.0488 


(50,100,100) 


.0455^ 


.0496 


.0491 


.0489 


.0463 


.0487 


.0495 


(100,100,100) 


.0502 


.0515 


.0496 


.0484 


.0482 


.0482 


.0456^= 



Table 4: The empirical significance levels for the three-class case under Ho : CSR independence with 
Nmc = 10000, ni,n2,n3 in {10,30,50,100} at the nominal level of a = 0.05. ("^i the empirical size is 
significantly smaller than 0.05; i.e., the test is conservative. *: the empirical size is significantly larger than 
0.05; i.e., the test is liberal, base = base-class-specific test, NN = NN-class-specific test, overall = overall 
test.) 



6.2 Empirical Significance Levels for the Three-Class Case under RL 

To remove the confounding effect of conditional nature of the tests under CSR independence, we also perform Monte 
Carlo simulations under the RL pattern. For RL with q > 2 classes, we consider two cases. In each case, first we 
determine the locations of points for which labels are to be assigned randomly. Then we apply the RL process to these 
points for various appropriate sample size combinations. 

RL Case (1): First, we generate n = {ni +n2 + 713) points iid W((0, 1) x (0, 1)) for some combinations of ni, 712, ns G 
{10,30,50,100}. The locations of these points are taken to be the fixed locations for which we assign the labels 
randomly. Thus, we simulate the RL pattern for the performance of the tests under the null case. For each sample 
size combination (ni, 712,713) we pick ni points (without replacement) and label them as X, pick 712 points from the 
remaining points (without replacement) and label them as Y points, and label the remaining 713 points as Z points. 
We repeat the RL procedure Nmc ~ 10000 times for each sample size combination. For each RL iteration, we construct 
the NNCT for classes X, Y , and Z and then compute the test statistics. Out of these 10000 samples the number of 
significant results by the tests is recorded. The nominal significance level used in all these tests is a = 0.05. Based on 
these significant results, empirical sizes are calculated as before. 

RL Case (2): We generate rii points iid U{(0, 1) x (0, 1)), 712 points iid W((2, 3) x (0, 1)), and 713 points iid W((l, 2) x 
(2,3)) for some combinations of 711,712,713 £ {10,30,50,100}. RL procedure is performed and the empirical sizes for 
the tests are calculated similarly as in RL Case (1). 

The locations for which the RL procedure is applied in RL Cases (l)-(2) are plotted in Figure [2] for 711 = n2 = 
713 — 100. We present the empirical significance levels for the NNCT-tests in Table O where the empirical significance 
level labeling is as in Table |4] The empirical sizes are marked with and ^ for conservativeness and liberalness as in 
Section [5. II Observe that when all sample sizes are equal each test statistic is about the desired significance level in 
rejecting Ho : RL for both RL Cases. When at least one class has a small sample size (ti^ < 10), the tests are liberal, 
conservative, or at the desired level. In particular, the base-class-specific test tends to be conservative for class X, and 
NN-class-specific test tends to be conservative for class Y. 

Comparing Tables |3] and [S] we observe that the empirical sizes are about the same under the CSR independence 
and RL patterns. Hence we conclude that the conservativeness or liberalness of the tests for small sample sizes does 
not result from conditioning on Q and R under the CSR independence pattern, as we get similar results under the RL 
pattern also. 



RLCase(1) RLCase(2) 




0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.5 1.0 1.5 2.0 2.5 3.0 

X coordinate x coordinate 



Figure 2: The fixed locations for which RL procedure is apphcd for RL Cases (l)-(2) with ni — n2 = ~ 100 
in the two-class case. Notice that x-axis for RL Case (2) is differently scaled. 

Remark 6.1. Main Result of Monte Carlo Simulations for Empirical Sizes: Based on the simulation results 
presented in Sections [S] and [§1 we conclude that none of the tests we consider have the desired level when at least 
one sample size is small so that the cell count(s) in the corresponding NNCT have a high probability of being < 5. 
This corresponds to the case that at least one sample size is < 10 in our simulation study. However, when relative 
abundances (i.e., sample sizes) are very different, cell counts are also more likely to be < 5, compared to cell counts 
for similar sample sizes (roughly, we assume sample sizes are similar when maxi(ni)/ mini(ni) < 2.) The tests tend 
to be conservative when the row or column they pertain to contain cell(s) whose count(s) are < 5. For larger samples 
which i mply the cell counts are larger than 5, the tests yield empirical sizes that are about the desired nominal level. 
iDixonI (|1994| ') recommends Monte Carlo randomization test when some cell count (s) are < 5 in a NNCT for the 
overall segregation test. We extend this recommendation for the base- and NN-class-specific tests, in the sense that, 
if the corresponding row or column contains cells whose counts are < 5, we recommend Monte Carlo randomization, 
otherwise we recommend the asymptotic approximation for the class-specific tests. Thus, when sample sizes are 
small (hence the corresponding cell counts are < 5), the asymptotic approximation of the tests is not appropriate, in 
particular for the base-class-specific test for the smaller class. So when at least one sample size is small, the power 
comparisons should be carried out using the Monte Carlo critical values. On the other hand, for large samples, the 
power comparisons can be made using the asymptotic or Monte Carlo critical values. □ 

Remark 6.2. Monte Carlo Critical Values: When sample sizes are small so that some cell count(s) are expected 
to be < 5 with a high probability, then it will not be appropriate to use the asymptotic approximation (hence the 
asymptotic critical values) for the overall and class-specific tests of segregation (see Remark 16. If) . That is, when some 
cell counts are small, the tests tend to be either liberal or conservative. In order to better evaluate the empirical 
power performance of the tests, for each sample size combination, we record the test statistics at each Monte Carlo 
simulation under the CSR independence cases of Sections 15.11 and 16.11 We find the 95"* percentiles of the recorded 
test statistics at each sample size combination (not presented) and use them as "Monte Carlo critical values" for the 
power estimation in the following sections. For example, for base-class-specific test for class X in the two-class case, 
the lest statistics are recorded for (721,712) — (30, 50) under the CSR independence pattern as in Section [5. II then 
the 95"' percentile of these statistics is used as the Monte Carlo critical value for (ni, n2) = (30, 50). That is, under a 
segregation or association alternative with (tii, 112) = (30, 50), a calculated test statistic is deemed significant if greater 
than or equal this Monte Carlo critical value. □ 



7 Empirical Power Analysis for the Two- Class Case 



We consider three cases for each of segregation and association alternatives in the two-class case. 



Empirical significance levels under RL Case (1) 



sizes 


base 


NN 
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(ni, 712,713) 














OlD 


(10,10,10) 


.0453'= 


miv 


.0374^= 


.0360^= 


.040r 


.0393^= 


miT 


(10,10,30) 


.0298'= 


.0319^= 


.0483 


.0474 


.0477 


.0402^= 


.0455^= 


(10,10,50) 


.0346^ 


.0304^= 


.0478 


.0497 


.0459^= 


.0436^= 


.0477 


(10,30,30) 


.0466 


.0528 


.0522 


.0497 


.0444^= 


.0476 


.0477 


(10,30,50) 


.0563*^ 


.0491 


.0467 


.0551* 


.0470 


.0477 


.0518 


(10,50,50) 


.0403^ 


.0471 


.0481 


.0554* 


.0460^= 


.0445^= 


.0500 


(30,30,30) 


.0447^= 


.0465 


.043r 


.0506 


.0473 


.0479 


.0471 


(30,30,50) 


.0479 


.0491 


.0529 


.0485 


.0508 


.0519 


.0489 


(30,50,50) 


.0503 


.0507 


.0526 


.0509 


.0472 


.0501 


.0473 


(50,50,50) 


.0482 


.0533 


.0528 


.0482 


.0506 


.0481 


.0469 


(50,50,100) 


.0488 


.0481 


.0494 


.0481 


.0468 


.0462^= 


.0450^= 


(50,100,100) 


.0473 


.0486 


.0465 


.0472^= 


.0496 


.0459^= 


.0476 


(100,100,100) 


.0516 


.0525 


.0518 


.0485 


.0508 


.0510 


.0510 


Empirical significance levels under RL Case (2) 


(10,10,10) 


.0371^ 


.0384^= 


.0397^= 


.0358^= 


.0339== 


.0355^= 


.0372^= 


(10,10,30) 


.0553* 


.0550* 


.0457^= 


.0457^= 


.0447^= 


.0426^= 


.0470 


(10,10,50) 


.0368^ 


.0365^= 


.0458^= 


.0497 


.0523 


.0477 


.0501 


(10,30,30) 


.0433^ 


.0498 


.0485 


.0485 


.0463^= 


.0435^= 


.0420^= 


(10,30,50) 


.0503 


Mil" 


.0473 


.0513 


.046^ 


.043r 


.0469 


(10,50,50) 


.0401^ 


.0493 


.0476 


.0510 


.0483 


.0489 


.0507 


(30,30,30) 


.0484 


.0452^= 


.0493 


.0489 


.0464 


.0504 


.0472 


(30,30,50) 


.0429^ 


.0462^= 


.0454^= 


.0462^ 


.0433^= 


.0485 


.0436^= 


(30,50,50) 


.0488 


.0502 


.0510 


.0482 


.0479 


.0476 


.0475 


(50,50,50) 


.0539*^ 


.0511 


.0525 


.0468 


.0501 


.0531 


.0513 


(50,50,100) 


.0518 


.0506 


.0507 


.0490 


.0498 


.0506 


.0497 


(50,100,100) 


.0504 


.0500 


.0483 


.0483 


.0511 


.0488 


.0516 


(100,100,100) 


.0542*^ 


.0506 


.0477 


.0463 


.0506 


.0479 


.0457^= 



Table 5: The empirical significance levels for the three-class case under Ho : RL for RL Cases (l)-(2) with 
Nmc = 10000, 7ii,7i2,7i3 in {10,30,50,100} at the nominal level of a = 0.05. {'^: the empirical size is 
significantly smaller than 0.05; i.e., the test is conservative. ^: the empirical size is significantly larger than 
0.05; i.e., the test is liberal, base = base-class-specific test, NN = NN-class-specific test, overall = overall 
test.) 

7.1 Empirical Power Analysis under Segregation Alternatives in the Two-Class 
Case 

For the segregation alternatives, we generate Xi W((0, 1 — s) x (0, 1 — s)) and Yj *~ W((s, 1) x (s, 1)) for i = 1, . . . , ni 
and j = 1,. . . ,712. Notice that the level of segregation is determined by the magnitude of s £ (0, 1). We consider the 
following three segregation alternatives: 

H's:s = 1/G, H's' -.3 = 1/4:, a.nd H's" -.s^ 1/3. (8) 

Observe that, from to Hg^' (i.e., as s increases), the segregation between X and Y gets stronger in the 
sense that X and Y points tend to form one-class clumps or clusters. We calculate the power estimates using the 
asymptotic critical values based on the corresponding x^-distributions and using the Monte Carlo critical values. The 
power estimates based on the asymptotic and Monte Carlo critical values are presented in Table |6] where f3f are for 
(Dixon's) base-class-specific tests for classes i = 1,2, and (3i'^ is for NN-class-specific test for class 1, (3^ is for Dixon's 
overall segregation test. We only present (3^^ since f3^^ = [32^ = . Observe that, for both class-specific tests, 
as n = (jii -|- 712) gets larger, the power estimates get larger; for similar ti = (711 + 712) values, the power estimate is 
larger for classes with similar relative abundance (i.e., for similar sample sizes ni 712); and as the segregation gets 
stronger, the power estimates get larger for each sample size combination. Furthermore, power estimates based on the 
Monte Carlo critical values tend to be larger compared to the ones using the asymptotic critical values. When sample 
sizes are similar, NN-class-specific tests tend to have higher power. On the other hand, when relative abundances (i.e., 



sample sizes) are very different, the base-class-specific test for classes X and Y have the highest and lowest power 
estimates, respectively. 



Empirical power estimates under the segregation alternatives 
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(10,10) 
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.4045 


.3911 
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.4096 


(50,50) 


.5091 


.5016 


.5546 


.5347 


.5237 


.5541 


(50, 100) 


.7686 


.6689 


.7425 


.7540 


.6629 


.7307 


(100, 100) 


.8761 


.8730 


.9121 


.8778 


.8891 


.9119 


H¥ 


(10,10) 


.2057 


.2044 


.2305 


.2439 


.2419 


.2364 


(10,30) 


.4601 


.4133 


.4555 


.5365 


.4181 


.4745 


(10,50) 


.5420 


.4477 


.6174 


.6368 


.4627 


.6187 


(30, 30) 


.7783 


.7769 


.8141 


.7783 


.7769 


.8207 


(30,50) 


.9262 


.8775 


.9126 


.9095 


.8712 


.9199 


(50,50) 


.9543 


.9551 


.9777 


.9594 


.9593 


.9777 


(50, 100) 


.9977 


.9866 


.9975 


.9974 


.9857 


.9972 


(100,100) 


.9998 


.9999 


1.000 


.9998 


.9999 


1.000 




(10,10) 


.5144 


.5121 


.5817 


.5641 


.5610 


.5852 


(10,30) 


.8873 


.7833 


.8787 


.9172 


.7871 


.8892 


(10,50) 


.9353 


.8002 


.9528 


.9570 


.8116 


.9530 


(30,30) 


.9929 


.9915 


.9969 


.9929 


.9915 


.9971 


(30, 50) 


.9999 


.9979 


.9997 


.9999 


.9976 


.9998 


(50,50) 


.9999 


1.000 


1.000 


1.000 


1.000 


1.000 


(50, 100) 


1.000 


1.000 


1.000 


1.000 


1.000 


1.000 


(100,100) 


1.000 


1.000 


1.000 


1.000 


1.000 


1.000 



Table 6: The empirical power estimates for the tests under the segregation alternatives, Hg, Hg^ , and Hg^^ 
for the two-class case with Nmc = 10000, for some combinations of ni,n2 G {10,30,50, 100} at the nominal 
level of a = 0.05. (*only is presented since (3^^ = = /3d- Asymptotic = with asymptotic critical 
values, Monte Carlo = with Monte Carlo critical values.) 



7.2 Empirical Power Analysis under Association Alternatives in the Two-Class 

Case 

For the association alternatives, we consider three cases. First, we generate Xi *~ W((0, 1) x (0, 1)) for i — 1, 2, . . . , ni. 
Then we generate Yj for j — 1,2, . . . ,n2 as follows. For each j, we pick an i randomly, then generate Y, as Xi + 

Rj {cos Tj, sin Tj)' where Rj '~ W(0, r) with r € (0,1) and T, W(0, 2 7r). In the pattern generated, appropriate 
choices of r will imply association between classes X and Y. That is, it will be more likely to have {X, Y) or (V, X) 
NN pairs than the same-class NN pairs (i.e., {X,X) or {Y,Y)). The three values of r we consider constitute the 
following three association alternatives; 

Hi-.r^ 1/4, H'a' -.r^ 1/7, and H'a" : r = 1/10. (9) 

Observe that, from H^, to H^' (i-e., as r decreases), the association between X and Y gets stronger in the sense 
that X and Y points tend to occur together more and more frequently. By construction, for similar sample sizes 
the association between X and Y are at about the same degree as association between Y and X. For very different 
samples, smaller sample is associated with the larger but the abundance of the larger sample confounds its association 
with the smaller. 

The empirical power estimates, based on the asymptotic critical values and Monte Carlo critical values, are 
presented in Table [7] Observe that when ni « n2, for both class-specific tests and Dixon's overall segregation test, as 
rii gets larger, the power estimates get larger; and as the association gets stronger, the power estimates get larger for 



each sample size combination. Furthermore, empirical power estimates based on Monte Carlo critical values tend to 
be larger for NN-class-specific tests, smaller for base-class-specific tests. 

When at least one class has a small size (i.e., rii < 10), the base-class-specific test for the smaller class, say class 
i, fails to detect any deviation from CSR independence for row i. But the base-class-specific test for the larger class 
is robust to the differences in relative abundance (i.e., sample sizes) and behaves as expected. Likewise, the NN-class- 
specific test is not affected by the differences in relative abundance as long as the smaller class is associated with the 
larger (the corresponding simulation results are not presented). In our set-up this corresponds to the case that class 
Y is much larger than class X. If on the contrary, the smaller class is associated with the larger (i.e., class X is much 
larger than class Y), then NN-class-specific test has very poor power performance. 

When sample sizes are similar, the base-class-specific tests for class Y (the class less associated with the other) 
have higher power under weak association, while NN-class-specific tests have higher power under strong association. 
When sample sizes are very different, the base-class-specific tests for classes Y and X have highest and lowest power 
estimates, respectively. Base-class-specific test for class i measures the association of other classes with class i, while 
NN-class-specific test for class j measures the association of class j with other classes. 



Empirical power estimates under the association alternatives 





SIZGS 


Asymptotic 


Monte Carlo 




(ni,n2) 


Pi 


P2 


(OiV i V ^ 

Pi 


Pi 


Pi 


Pi 


Hi 


(10,10) 


.1349 


.1776 


.1105 


.1352 


.1785 


.1282 


(10,30) 


.0002 


.4366 


.3007 


.0008 


.4451 


.3334 


(10,50) 


.0002 


.4947 


.3318 


.0010 


.5070 


.3435 


(30, 30) 


.1413 


.2434 


.1697 


.1388 


.2397 


.1795 


(30,50) 


.1833 


.3984 


.2903 


.1736 


.3903 


.3075 


(50,50) 


.1149 


.2421 


.1738 


.1153 


.2421 


.1729 


(50, 100) 


.1813 


.4411 


.3410 


.1398 


.4236 


.3230 


(100,100) 


.0853 


.2309 


.1677 


.0855 


.2310 


.1663 


Hi' 


(10,10) 


.2499 


.2569 


.1834 


.2500 


.2571 


.2010 


(10,30) 


.0000 


.6463 


.4956 


.0001 


.6550 


.5461 


(10,50) 


.0000 


.7062 


.5500 


.0004 


.7138 


.5591 


(30, 30) 


.4053 


.4457 


.4141 


.3987 


.4383 


.4298 


(30, 50) 


.4896 


.6957 


.6332 


.4661 


.6873 


.6559 


(50,50) 


.4034 


.4961 


.4616 


.4034 


.4961 


.4609 


(50, 100) 


.5527 


.7991 


.7559 


.4783 


.7858 


.7428 


(100,100) 


.3868 


.5575 


.5013 


.3868 


.5575 


.4987 




(10,10) 


.3038 


.2918 


.2222 


.3038 


.2920 


.2401 


(10,30) 


.0000 


.7364 


.6003 


.0000 


.7428 


.6480 


(10,50) 


.0000 


.7907 


.6512 


.0001 


.7983 


.6582 


(30,30) 


.6092 


.6011 


.6157 


.5991 


.5915 


.6289 


(30, 50) 


.7211 


.8491 


.8386 


.6880 


.8441 


.8535 


(50,50) 


.6842 


.6891 


.7285 


.6842 


.6891 


.7280 


(50, 100) 


.8024 


.9442 


.9433 


.7490 


.9376 


.9383 


(100,100) 


.7207 


.7973 


.8030 


.7207 


.7973 


.8011 



Table 7: The empirical power estimates for the tests under the association alternatives Hj^, H^^ , and H^^^ 
for the two-class case with Nmc — 10000, for some combinations of ni,n2 G {10,30,50, 100} at the nominal 
level of a = 0.05. (*only /3f ^ is presented since = /?^^ = Pd- Asymptotic — with asymptotic critical 
values, Monte Carlo = with Monte Carlo critical values.) 



8 Empirical Power Analysis in the Three- Class Case 



We consider three cases for each of segregation and association alternatives in the three-class case. 



8.1 Empirical Power Analysis under Segregation Alternatives in the Three-Class 

Case 



For the segregation alternatives, we generate Xi '~ U{(0, 1 — 2s) x (0, 1 — 2s)), Yj U{(2s, 1) x (2s, 1)), and Zi '~ 
U{{s, 1 — s) X (s, 1 — s)) for i = 1, . . . ,ni, j — 1, . . . , 712, and £ = 1, . . . ,ns. Notice the level of segregation is determined 
by the magnitude of s £ (0, 1/2). We consider the following three segregation alternatives: 

Hi: s = 1/12, ; s = 1/8, and : s = 1/6. (10) 

Observe that, from Hg to Hg (i.e., as s increases), the segregation gets stronger in the sense that X, Y, and Z 
points tend to form one-class clumps or clusters. Furthermore, under each segregation alternative, X and Y are more 
segregated compared to X and Z or Y and Z. 

The empirical power estimates using the asymptotic and Monte Carlo critical values are provided in Tables |8] and 
[9l respectively. Observe that, in both tables, at each sample size combination, as the segregation gets stronger, the 
power estimates get larger. Furthermore, the power estimates for the similar sample sizes tend to be larger compared 
to the different sample sizes when the total sizes n — (ni + n2 + ns) are similar. The power estimates also confirm 
that classes X and Y are more segregated compared to classes X and Z ox Y and Z. Furthermore, power estimates 
using the Monte Carlo critical values tend to be higher compared to the ones using the asymptotic critical values. In 
fact, based on Remark 16. II when at least one of the samples is small (i.e., Ui < 10) the estimates based on the Monte 
Carlo critical values would be more appropriate. For larger samples, asymptotic and Monte Carlo based empirical 
power estimates are both reliable. 

For large samples, Dixon's overall test tends to have the highest power estimates, but class-specific tests provide 
more information about the pattern. For all (small or large) similar relative abundance values, NN-class-specific tests 
have slightly better power performance, while for very different relative abundances, beise-class-specific tests have 
slightly better power performance. However, overall, we can conclude that in the three-class case, both base- and 
NN-class-specific tests and Dixon's overall test have about the same performance in terms of power. 

8.2 Empirical Power Analysis under Association Alternatives in the Three-Class 
Case 

For the association alternatives, we also consider three cases. First, we generate Xi ~ W((0, 1) x (0,1)) for i = 
1, 2, . . . , ni. Then we generate Yj and Ze for j — 1, 2, . . . , ?i2 and £ — 1,2, . . . ,ns as follows. For each j, we pick 

an i randomly, then generate Rj '~ W(0, Vy) with ry G (0, 1) and Tj '~ W(0, 2 vr) set Yj := Xi + Rj (cos Tj, sin Tj)' . 

Similarly, for each £, we pick an i randomly, then generate Rf W(0, r^) with £ (0, 1) and Ue '~ W(0, 2 n) and set 
Zi ■- X, + Rf (cos Ui, sin Ui)' . 

In the pattern generated, appropriate choices of Ty (and r^) values will imply association between classes X and 

Y (and X and Z). The three association alternatives are as 

H\:ry = 1/7, = 1/10, HI : ry = 1/10, r, = 1/20, H\:ry = 1/13, = 1/30. (11) 

Observe that, from H\ to H\ (i.e., as Ty decreases), the association between X and Y gets stronger in the sense that 

Y points tend to be found more and more frequently around the X points. The same holds for X and Z points as 
decreases. Furthermore, by construction, classes X and Z are more associated compared to classes X and Y . On the 
other hand, classes Y and Z are not associated, but perhaps are mildly segregated. 

We present the empirical power estimates using the asymptotic and Monte Carlo critical values in Tables [10] and 
111! respectively. Observe that for each sample size combination, as the association gets stronger, the power estimates 
tend to be higher. Based on Remark 16. II we use the power estimates based on the Monte Carlo critical values when at 
least one sample is small (i.e., Ui < 10). For larger samples, both asymptotic and Monte Carlo based empirical power 
estimates are reliable for comparative purposes. For all sample size combinations, NN-class-specific tests have better 
performance in terms of power in the sense that the highest power estimate for the three NN-class-specific tests tends 
to be larger than the highest power estimate for the three base-class-specific test at each sample size combination. 
Moreover, among the base-class-specific tests, the one for class 3 (i.e., class Z) has the highest power, while among 
the NN-class-specific tests, the one for class 1 (i.e., class X) has the highest power estimates. 

Remark 8.1. Main Result of Monte Carlo Power Analysis: Based on the recommendations made in Remark 
16.11 when at least one sample size is small (in the sense that some cell count is < 5), the power comparisons are more 
appropriate if the Monte Carlo critical values are used. For large samples, the power comparisons can be made using 
either the asymptotic or Monte Carlo critical values. In Sections 17.11 and 17.21 we observe that the power estimates 



using asymptotic and Monte Carlo critical values tend to be significantly different from each other when at least one 
of the samples is small, very similar when samples are larger. Under the segregation alternatives, for large samples 
with similar sizes, we conclude that NN-class-specific tests have higher power; when at least one sample is small, 
base-class-specific test for the smaller (larger) class has the highest (lowest) power estimates. Under the association 
alternatives, we observe that when at least one sample size is small, base-class-specific test for the smaller class (i.e., 
class X) has extremely poor performance, while for the larger class it has higher power than the NN-class-specific test. 
On the other hand, for larger samples, base-class-specific test for the largest (smallest) class has the highest (lowest) 
power. 

However base- and NN-class-specific tests answer different questions. So it is not appropriate to compare the base- 
and NN-class-specific tests. Thus, we recommend both of the base- and NN-class-specific tests to get more information 
on different aspects of the NN structure. □ 



9 Examples 

We illustrate the tests on three example data sets: an ecological data set (swamp tree data), an epidemiological data 
set (leukemia data), and an MRI data set (pyramidal neuron data). 

9.1 Swamp Tree Data 

iGood and Whipple! (|l982'') considered the spatial patterns of tree species along the Savannah River, South Carolina, 
U.S.A. From this data, iDixon (,,2002bi ') used a single 50m x 200m rectangular plot to illustrate NNCT methods. All live 
or dead trees with 4.5 cm or more dbh (diameter at breast height) were recorded together with their species. Hence 
it is an example of a realization of a marked multi-variate point pattern. The plot contains 13 different tree species, 
four of which comprises over 90 % of the 734 tree stems. The remaining tree stems were categorized as "other trees" . 
The plot consists of 215 water tupelo {Nyssa aquatica), 205 black gum {Nyssa sylvatica), 156 Carolina ash {Fraxinus 
caroliniana) , 98 bald cypress {Taxodium distichum), and 60 stems of 8 additional species (i.e., other species). A 5 x 5 
NNCT-analysis is conducted for this data set. If segregation among the less frequent species were important, a more 
detailed 13 x 13 NNCT-analysis should be performed. The locations of these trees in the study region are plotted in 
Figure [3] and the corresponding 5x5 NNCT together with percentages based on row and column sums are provided 
in Table [12] For example, for black gum as the base species and Carolina ash as the NN species, the cell count is 26 
which is 13 % of the 205 black gums (which is 28 % of all trees), and 15 % of the 171 times Carolina ashes serves as 
NN (which is 23 % of all trees). Observe that the percentages and Figure [3] are suggestive of segregation for all tree 
species, especially for Carolina ashes, water tupelos, black gums, and the "other" trees since the observed percentage 
of species with themselves as the NN is much larger than the marginal (row or column) percentages. 
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Figure 3: The seatter plot of the locations of water tupelos (triangles A), black gum trees (pluses -f ), Carolina 
ashes (crosses x), bald cypress trees (diamonds o), and other trees (inverse triangles V). 
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Figure 4: Second-order properties of swamp tree data. Functions plotted arc Ripley's univariate L-functions 
Lii{t) — t for J = 0, 1, . . . , 5, where i = stands for all data combined, i = 1 for water tupelos, i = 2 for black 
gums, i = 3 for Carolina ashes, i = 4 for bald cypresses, and j = 5 for other trees. Wide dashed lines around 
are the upper and lower 95 % confidence bounds for the L-functions based on Monte Carlo simulation under 
the CSR independence pattern. 



The locations of the tree species can be viewed a priori resulting from different processes so the more appropriate 
null hypothesis is the CSR independence pattern. Hence our inference will be a conditional one (see Remark 14. ip . We 
calculate Q = 472 and R = 454 for this data set. We present the overall test of segregation, class-specific test statistics 
and the associated p-values in Table where pasy stands for the p-value based on the asymptotic approximation (i.e., 
the corresponding distribution) , pmc is the p- value based on 10000 Monte Carlo replication of the CSR independence 
pattern in the same plot, and Prand is based on Monte Carlo randomization of the labels on the given locations of 
the trees 10000 times. Observe that Pasy, Pmc, and Prand are similar for each test. Overall test of segregation is 
significant implying significant deviation from the CSR independence pattern for at least one pair of tree species. 
Base-class-specific tests are all significant for all species but bald cypresses implying significant deviation in rows than 
expected unde r CSR independence, except for bald cypress trees. These findings are in agreement with the results of 
(|Dixonl (|2002bl )). NN-class-specific tests are significant for all species but bald cypresses, implying significant deviation 
in columns than expected under CSR independence for all species except for bald cypress trees. Hence except for bald 
cypresses, each tree species seem to result from a (perhaps) different first order inhomogeneous Poisson process. 

Based on the NNCT-tests above, we conclude that tree species exhibit significant deviation from the CSR inde- 
pendence pattern, except for bald cypresses. Considering Figure |3] and the corresponding NNCT in Table 1121 this 
deviation is toward the segregation of the species. Then, we might also be interested in the causes of the segregation 
and the type and level of interaction between the tree species at different scales (i.e., distances between the trees). 
To answer such questions, we also present the second-order analysis of the swamp tree data. We calculate Ripley's 

(univariate) L- function which is the modified version of K function as La (t) — ^ (^Kii{t)/Tr^ where t is the distance 

from a randomly chosen event (i.e., location of a tree). Knit) is an estimator of 

K{t) — A^^E[# of extra events within distance t of a randomly chosen event] (12) 

with A being the density (number per unit area) of events and is calculated as 



i^>i(t) = A-'^^w(Z„/j)I(d,j < t)/N 



(13) 



where A = N/A is an estimate of density (A'^ is the observed number of points and A is the area of the study region), 
dij is the distance between points i and j, I(-) is the indicator function, w{li, Ij) is the proportion of tlie circumference 
of the circle centered at U with radius dij that falls in the study area, which corrects for the boundary effects. Under 
CSR, L{t) — t = holds. If the univariate pattern exhibits aggregation, then L{t) — t tends to be positive, if it exhibits 
regularity then L{t) — t tends to be negative. The estimator K{t) is approximately unbiased for K{t) at each fixed 
t. Bias depends on the geometry of the study area and increases with t. For a rectang ular region it is recommended 
to use t values up to 1/4 of the smaller side length of the rectangle. See (|Digglel (|2003h ) for more detail. So we take 
the values t £ [0, 12.5] in our analysis, since the smaller side of the rectangular region of swamp tree data is 50 m. In 
Figure |4j we present the plots of Lii{t) — t functions for each species as well as the plot of all trees combined. We 
also present the upper and lower (pointwise) 95 % confidence bounds for each Lii{t) — t. Observe that for all trees 
combined there is significant aggregation of trees (the Loo{t) — t curve is above the upper confidence bound) at all 
scales (i.e., distances). Water tupelos exhibit significant aggregation for the range of the plotted distances; black gums 
exhibit significant aggregation for distances t > 1 m; Carolina ashes exhibit significant aggregation for the range of 
plotted distances; bald cypresses exhibit no deviation from CSR independence for f < 5 m, then they exhibit significant 
spatial aggregation for t > 4 m; other trees exhibit significant aggregation for the range of plotted distances. Hence, 
segregation of the species might be due to different levels and types of aggregation of the species in the study region. 

We also calculate Ripley's bivariate L-function as Lij{t) = (^Kij {t) /irj where Kij{t) is an estimator of 

Kijit) = A~^E[7^ of extra type j events within distance f of a randomly chosen type i event] 
with \j being the density of type j events and is calculated as 

i 3 

where rfi^.j, is the distance between fc"" type i and l^^ type j points, w{ik,ji) is the proportion of the circumference of 
the circle centered at fc"' type i point with radius di^.j, that falls in the study area, which is used for edge correction. 
Under CSR independence, Lij{t) — t = holds. If the bivariate patte rn is se g regat ion, then Lij{t) — t tends to be 
negative, if it is association then Lij{t) ~ t tends to be positive. See (|Digglel (|2003l )) for more detail. In Figure [S] 
we present the bivariate plots of Lij{t) — t functions together with the upper and lower (pointwise) 95 % confidence 
bounds for each pair of species (due to the symmetry of Lij{t) there are only 10 different pairs). Observe that for 
distances up to t « 10 m, water tupelos and black gums exhibit significant segregation [L\2{t) — t is below the lower 
confidence bound) and for the rest of the plotted distances their interaction is not significantly different from the CSR 
independence pattern; water tupelos and Carolina ashes are significantly segregated up to about t « 10 m; water 
tupelos and bald cypresses do not have significant deviation from the CSR independence pattern for distances up to 
4 m, for larger distances they exhibit significant segregation; water tupelos and the other trees do not deviate from 
CSR independence for the range of the plotted distances. Black gums and Carolina ashes are significantly segregated 
for t > 2 m; black gums and bald cypresses are significantly segregated for t > 2 m; black gum and other trees are 
significantly segregated for all the distances plotted. Carolina ashes and bald cypresses are significantly associated for 
distances larger than 3 m; and Carolina ashes and the other trees exhibit significant segregation for 3 < t < 7 m and 
for f > 11 m they exhibit significant association. On the other hand, bald cypresses and other trees are significantly 
associated for distance larger than 4 m. 

But Ripley's _fL-fu nction is cumulative, so interpreting the spatial interaction at larger dista nces is pro blematic 
i Wiegand et^\2m% ). The (accumulative) pair correlation function g{t) is better for this purpose (jStovan and Stovanl 
1994f )). The pair correlation function of a (univariate) stationary point process is defined as 



where K'{t) is the derivative of K{t). For a univariate stationary Poisson process, g{t) = 1; values of g{t) < 1 
suggest inhibition (or regularity) between points; and values of g{t) > 1 suggest clustering (or aggregation). The pair 
correlation functions for all trees and each species for the swamp tree data are plotted in Figure [G] Observe that all 
trees are aggregated around distance values of 0-1,3,4,5,7,9-10 m; water tupelos are aggregated for distance values of 
0-4 and 5-7 m; black gums are aggregated for distance values of 1-6 and 8-11 m; Carolina ashes are aggregated for all 
the range of the plotted distances; bald cypresses are aggregated for distance values of 2-8 and around 11m; and other 
trees are aggregated for all distance values except 3-5 m. Comparing Figures |4] and [G] we see that Ripley's L and pair 
correlation functions detect the same patterns but with different distance values. That is, Ripley's L implies that the 
particular pattern is significant for a wider range of distance values compare d to g(t), since Ripley's L is cumulative, 
so the values of L at small scales confound the values of L at larger scales (|Loosmore and FordI (|2006l Vl. Hence the 
results based on pair correlation function g{t) are more reliable. 



The same definition of tlie pair correlation function can be applied to Ripley's bivariate K or L-functions as well. 
The benchmark value of Kij{t) — nt'^ corresponds to g{t) — 1; g{t) < 1 suggests segregation of the classes; and 
g{t) > 1 suggests association of the classes. The bivariate pair correlation functions for the species in swamp tree data 
are plotted in Figure [7] Observe that water tupelos and black gums are segregated for distance values of 0-1 m; water 
tupelos and Carolina ashes are segregated for values of 0-1 and 2.5 m and are associated for values about 6 m; water 
tupelos and bald cypresses are segregated for 0-1, 5.5, 9.5, and 11 m and are associated for 6.5 m; water tupelos and 
other trees are segregated for 0-0.5 and 7 m and are associated for 8 m; black gums and Carolina ashes are segregated 
for 2-2.5, 3.5-4.5, 6-8.5, and 9.5-12 m; black gums and bald cypresses are segregated for 3.5, 5.5-6.5,7, and 9.5 m; black 
gums and other trees are segregated for 5 and 6-7.5 m; Carolina ashes and bald cypresses are associated for 1.5-3, 
5.5., and 7 m; Carolina ashes and other trees are associated for 5 and 9-10 m; and bald cypresses and other trees are 
segregated for 4 m and are associated for 3-4 and 6.5-7.5 m. 

However the pair correlation function estimates might have critical behavior for small t if g{t) > 0, since the 
estimator variance and he nce the bias are considerably large. This problem gets worse especially in cluster processes 
l|Stovan and StovanI (| 19961 )). See for example Figures [S] and [7] where the confidence bands for smaller t values are much 
wider compared to those for larger t values. So pair correlation function analysis is more reliable for larger distances 
and it is safer to use g{t) for distances larger than the average NN distance in the data set. Comparing Figure |4] 
with Figure |S] and Figure [S] with Figure [7] we see that Ripley's L and pair correlation functions usually detect the 
same large-scale pattern but at different ranges of distance values. Ripley's L suggests that the particular pattern is 
significant for a wider range of distance values compar ed to g(t), since values of L at small scales confound the values 
of L at larger scales where g{t) is more reliable to use (|Loosmore and FordI (|2006t )). 

While second order analysis (using Ripley's K and L-functions or pair correlation function) provides information on 
the univariate and bivariate patterns at all scales (i.e., for all distances), NNCT-tests summarize the spatial interaction 
for the smaller scales (for distances about the average NN distance in the data set). In particular, for the swamp tree 
data average NN distance (± standard deviation) is about 1.8 (± 1.04) meters and notice that Ripley's L-function and 
NNCT-tests yield similar results for distances about 2 meters. It is interesting to observe that both methods indicate 
that bald cypresses have a very different pattern compared to the others. 

9.2 Leukemia Data 

ICuzick and Edwardsl (|l990l 'l considered the spatial locations of 62 cases of childhood leukemia in the North Humberside 
region of the UK between the years 1974 to 1982 (inclusive). A sample of 143 controls are selected using the completely 
randomized design from the same region. We analyze the spatial distribution of leukemia cases and controls in this data 
using a 2 X 2 NNCT. We plot the locations of these points in the study region in Figure[S]and provide the corresponding 
NNCT together with percentages based on row and column sums in Table 1141 Observe that the percentages in the 
diagonal cells are about the same as the marginal (row or column) percentages of the subjects in the study, which 
might be interpreted as the lack of any deviation from the null case for both classes. Figure [3] is also supportive of 
this observation. 

We can assume that some processes affect a posteriori the population of North Humberside region so that some 
of the individuals get to be cases, while others continue to be healthy (i.e., they are controls). So the appropriate 
null hypothesis is the RL pattern. We calculate Q = 152 and R = 142 for this data set. In Table 1151 we present 
the overall test of segregation, class-specific test statistics and the associated p-values (i.e., p-values based on the 
asymptotic approximation, Monte Carlo simulation, and Monte Carlo randomization methods). Observe that Pasy, 
Pmc, and Prand are similar for each test and none of the tests yields a significant result. 
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Table 8: The empirical power estimates for the tests under the segregation alternatives, Hg, Hg, and Hg 
for the three-class case with Nmc = 10000, for some combinations of ni, n2 G {10, 30, 50, 100} at the nominal 
level of a = 0.05. 
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Table 9: The empirical power estimates for the tests under the segregation alternatives, Hg, Hg, and Hg 
for the three-class case with Nmc = 10000, for some combinations of ni, n2 G {10, 30, 50, 100} at the nominal 
level of a = 0.05. 
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Table 10: The empirical power estimates for the tests under the association alternatives, H\, H\, and H\ 
for the three-class case with Nmc — 10000, for some combinations of rii, 712 G {10, 30, 50, 100} at the nominal 
level of a = 0.05. 
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.9914 


.5375 


.9984 
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.8928 


.9662 


1.000 




(10,10,10) 


.2350 


.1774 


.5875 


.6110 


.2453 


.1942 


.4980 




(10,10,30) 


.0590 


.0730 


.8597 


.8005 


.2533 


.2253 


.7381 




(10,10,50) 


.0016 


.0543 


.8941 


.7721 


.2314 


.1426 


.7223 




(10,30,30) 


.1529 


.1055 


.8917 


.8171 


.6119 


.1956 


.8411 




(10,30,50) 


.0872 


.0576 


.9343 


.7750 


.6337 


.1776 


.8427 




(10,50,50) 


.1332 


.0505 


.9291 


.7379 


.7285 


.2814 


.8546 




(30,30,30) 


.8579 


.3995 


.9558 


.9872 


.7610 


.6843 


.9739 




(30,30,50) 


.8899 


.2631 


.9945 


.9973 


.7634 


.8087 


.9947 




(30,50,50) 


.8926 


.4000 


.9965 


.9984 


.9078 


.7132 


.9982 




(50,50,50) 


.9864 


.5333 


.9972 


.9996 


.9285 


.9230 


.9993 




(50,50,100) 


.9887 


.3094 


1.000 


1.000 


.9248 


.9790 


1.000 




(50,100,100) 


.9873 


.4196 


1.000 


1.000 


.9907 


.9242 


1.000 




(100,100,100) 


.9999 


.6639 


1.000 


1.000 


.9928 


.9972 


1.000 



Table 11: The empirical power estimates for the tests under the association alternatives, H\, H\, and H\ 
for the three-class case with Nmc — 10000, for some combinations of rii, n2 £ {10, 30, 50, 100} at the nominal 
level of a = 0.05. 



NN 





W.T. 


B.C. 


C.A. 


B.C. 


O.T. 


sum 


W.T. 


112 


40 


29 


20 


14 


215 


B.C. 


38 


117 


26 


16 


8 


205 


base C.A. 


23 


23 


82 


22 


6 


156 


B.C. 


19 


29 


29 


14 


7 


98 


O.T. 


7 


8 


5 


7 


33 


60 


sum 


199 


217 


171 


79 


68 


734 








NN 










W.T. 


B.C. 


C.A. 


B.C. 


O.T. 


sum 


W.T. 


52 % 


19 % 


13 % 


9 % 


7 % 


29 % 


B.C. 


19 % 


57% 


13 % 


8 % 


4 % 


28 % 


base C.A. 


15 % 


15 % 


53 % 


14 % 


4 % 


21 % 


B.C. 


19 % 


30 % 


30 % 


14 % 


7 % 


13 % 


O.T. 


12 % 


13 % 


8 % 


12 % 


55 % 


8 % 




Table 12: The NNCT for swamp tree data (top), the percentages of each row, i.e., the percentages of tree 
species serving as NN to each tree species (middle) , and the percentages of each column, i.e, the percentages 
of tree species serving as base to each tree species (bottom). W.T. = water tupelo, B.C. = black gum, C.A. 
= Carolina ash, B.C. = bald cypress, and O.T. = other tree species. 



Test statistics and p-values for se 


gregation tests 




X'^-statistic 


Pasy 


Pmc 


Prand 


overall 


275.64 


< .0001 


< .0001 


< .0001 


W.T. 


42.27 


< .0001 


< .0001 


< .0001 


B.C. 


65.13 


< .0001 


< .0001 


< .0001 


base C.A. 


70.99 


< .0001 


< .0001 


< .0001 


B.C. 


7.09 


.1313 


.1315 


.1291 


O.T. 


117.48 


< .0001 


< .0001 


< .0001 


W.T. 


61.37 


< .0001 


< .0001 


< .0001 


B.C. 


75.96 


< .0001 


< .0001 


< .0001 


NN C.A. 


81.06 


< .0001 


< .0001 


< .0001 


B.C. 


10.73 


.0571 


.0611 


.0518 


O.T. 


118.23 


< .0001 


< .0001 


< .0001 



Table 13: The x^-statistics for the segregation tests and the corresponding p-values. Pasy stands for the 
p-value based on the asymptotic approximation, pmc is the p-value based on 10000 Monte Carlo replication 
of the CSR independence pattern in the same region, and Prand is based on Monte Carlo randomization of 
the labels on the given locations of the trees 10000 times. W.T. = water tupelo, B.C. = black gum, C.A. = 
Carolina ash, B.C. = bald cypress, and O.T. = other tree species. 



NN 

case control 


sum 


NN 

case control 


sum 


NN 

case control 




1 case 
base 

control 


25 41 
39 113 


66 case 
152 control 


38 % 62 % 
26 % 74 % 


30 % case 
70 % control 


39 % 27 % 
61 % 73 % 




sum 


64 154 


218 




sum 


29 % 71 % 





Tabic 14: The NNCT for the leukemia data (left) and the corresponding percentages (right). 
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Figure 5: Second-order properties of swamp tree data. Functions plotted are Ripley's bivariate L- functions 
Lij{t) — t for i, j = 1, 2, . . . , 5 and i ^ j where i = stands for all data combined, i = 1 for water tupelos, 
i = 2 for black gums, i = 3 for Carolina ashes, i = 4 for bald cypresses, and i = 5 for other trees. Wide 
dashed lines around are the upper and lower 95 % confidence bounds for the L-functions based on Monte 
Carlo simulations under the CSR independence pattern. W.T. = water tupelo, B.G. = black gum, C.A. = 
Carolina ash, B.C. = bald cypress, and O.T. = other tree species. 
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Figure 6: Pair correlation functions for all trees combined and for each species in the swamp tree data. Wide 
dashed lines around 1 (which is the theoretical value) are the upper and lower (pointwise) 95 % confidence 
bounds for the L-functions based on Monte Carlo simulation under the CSR independence pattern. 



Test statistics and p- values for segregation tests 




X'^-statistic 


Pasy 


Pmc 


Prand 


overall 


2.25 


.3249 


.3303 


.3519 


1 case 
base 

control 


1.44 


.2293 


.2266 


.2389 


1.65 


.1995 


.2041 


.2237 


NN ^^^^ 
control 


2.25 


.3249 


.3303 


.3519 


2.25 


.3249 


.3303 


.3519 



Table 15: The x^-statistics for the segregation tests and the corresponding p-valucs. 
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Figure 7: Pair correlation functions for each pair of species in the swamp tree data. Wide dashed lines 
around 1 (which is the theoretical value) are the upper and lower (pointwise) 95 % confidence bounds for the 
L-functions based on Monte Carlo simulations under the CSR independence pattern. W.T. = water tupelos, 
B.G. = black gums, C.A. = Carolina ashes, B.C. = bald cypresses, and O.T. = other tree species. 



Leukemia Data 




Figure 8: The scatter plot of the locations of controls (circles o) and childhood leukemia cases (triangles A) 
in North Humberside, UK. 




Figure 9: Second-order analysis of North Humberside childhood leukemia data: Function plotted is Biggie's 
modified bivariate X-function D{t) = Kii{t) — K22{t) with i = f for controls and i = 2 for leukemia cases. 
Wide dashed lines around are plus and minus two standard errors of D{t) under RL of cases and controls. 

Based on the NNCT-tests above, we conclude that the cases and controls do not exhibit significant clustering (i.e., 
segregation). However, NNCT-methods only provide information on spatial interaction for distances about expected 
NN distance in the data set, so it might be the case that the type and level of interaction might still be different at 
larger scales (i.e., distances between the subjects' locations). However, the locations of the subjects in this population 
(cases and controls together) seem to be from an inhomogeneous Poisson process (see also Figure[8]). Hence Ripley's K - 
or L-func tions in the g eneral form are not appropriate to test for the spatial clustering of the cases (iKuUdorfil (|2006l )). 
However. iDigglel (|2003 ) suggests a version based on Ripley's univariate A"- functions as D{t) — Kii{t) — 7^22 (t) where 
Knit) is defined in Equation (|12|l . In this setup, "no spatial clustering" is equivalent to RL of cases and controls on the 
locations in the sample, which implies D{t) — 0, since K22{t) measures the degree of spatial aggregation of the controls 
(i.e., the population at risk), while K\i{t) measures this same spatial aggregation plus any additional clustering due 
to the disease. The test statistic D{t) is estimated by D{t) = K\\{t) — 7^22 (t), where Knit) is as in Equation (|13p. 
Figure [9] shows the plot of D{t) plus and minus two standard errors under RL. Observe that at distances about 200 
and 600 meters, there is evidence for mild clustering of diseases (i.e., segregation of cases from controls) since the 
empirical function D{t) gets close or a little above of the upper limit. At smaller scales, plot in Figure [2]is consistent 
with the results of the NNCT analysis. In particular average NN distance for leukemia data is 700 (± 1400) m, and 
NNCT analysis s ummari z es th e pattern for about t — 1000 m which is depicted in Figure [O] This same data set was 
also analyzed by (|Digglel (|2003l ') pp 131-132) and similar plots and results were obtained. 



9.3 Pyramidal Neuron Data 

This data set gives the (a:, j/)-coordinates of pyramidal neurons in area 24, layer 2 of the cingulate cortex. The data are 
taken from a unit square region (unit of measurement unknown) in each of 31 subjects, grouped as follows: controls 
consists of 12 subjects and correspond to cell numbers 1-655, schizoaffectives consists of 9 subjects and correspond cell 
numbers 656-1061, and schizophrenics consists of 10 subjects and correspond cell numbers 1062-1400. Controls are 
the subjects with no previous history of any mental disorder, schizoaffective disorder is a psychiatric disorder where 
both the symptoms of mood disorder and psychosis occur, and schizophr enia is a psychotic disorder characterized by 
severely impaired thinking, emotions, and behavior. iDiggle et al.l (|l99ll ) applied several methods for the analysis of 
the spatial distributions of pyramidal neurons in the cingulate cortex of human subjects in three diagnostic groupings. 
With a scaled Poisson analysis they found significant differences between the groups in the mean numbers of neurons 
in the sampled region, as well as a high degree of extra-Poisson variation in the distribution of cell counts within 
these groups. They employed two different functional descriptors of spatial pattern for each subject to investigate 
departures from completely random patterns, both between subjects and between groups, while adjusting for cell 
count differences. Since the distributions of their main functional pattern descriptor and of their derived test statistic 
are unknown, they applied a bootstrap procedure to attach p-values to their findings. 



Since the definition of the rectangular domain for identifying neuron positions is independent of neuronal cell density 



or the pattern and this samphng domain is almost identical for each subject, iDiggle et all dlQQll l merged (pooled) the 
data for each group. That is, the pyramidal neuron locations from control subjects were pooled into one group, from 
schizoaffective into another, and schizophrenic into another. Although, the spatial distributions between subjects are 
not th e same, we think pooling the data by group might reveal more than what might be concealed. iDiggle et al.l 
|l993) computed and compared Ripley's univariate K functions to detect differences between patterns across the 
three groups. Pattern analysis of the cellular arrangements demonstrated significant departures from complete spatial 
randomness in favor of spatial regularity for each group, in particular for schizophrenics. On the pooled data, we also 
apply multivariate pattern tests. In particular, we apply a 3 x 3 NNCT-analysis and Ripley's bivariate L-functions (in 
addition to the univariate L-functions). 

We plot the locations of these points in the study region in Figure[lO]and provide the corresponding NNCT together 
with percentages based on row and column sums in Table [TUl Observe that the pyramidal neuron locations appear to 
be somewhat regularly spaced, especially for schizophrenics. Also the percentages are slightly smaller for the diagonal 
cells, especially for the controls, compared to the marginal (row or column) percentages, which might be interpreted 
as presence of a deviation from CSR independence in favor of association. 



Pyramidal Neuron Data 
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Figure 10: The scatter plots of the locations of controls (circles 
schizophrenics (pluses +). 



)), schizoafFectives (triangles A), and 



NN 

Ctrl S.A. Sch. 


sum 


NN 

Ctrl S.A. Sch. 


sum 


, Ctrl 
base ^ . 
S.A. 

Sch. 


271 216 171 
212 107 89 
175 89 75 


658 Ctrl 
408 S.A. 
339 Sch. 


41 % 33 % 26 % 
52 % 26 % 22 % 
52 % 26 % 22 % 


47% 
29 % 

24 % 


sum 


658 412 335 


1405 








Table 16: The NNCT for the pyramidal neuron data (left) and the corresponding percentages (middle and 
right). Ctrl = Control, S.A. — Schizoaffective, and Sch. — Schizophrenics. 



The locations of the pyramidal neurons can be viewed a priori resulting from different processes, so the more 
appropriate null hypothesis is the CSR independence pattern. Hence our inference will be a conditional one (see 
Remark I4.1|l . We calculate Q = 888 and R — 892 for this data set. In Table [TTl we present the overall test of 
segregation, class-specific test statistics and the associated p-values based on the asymptotic approximation, Monte 
Carlo simulation, and Monte Carlo randomization. Observe that Pasy, Pmc, and Prand are similar for each test. The 



Test statistics and p- values for sc 


jregation tests 




X'^-statistic 


Pasy 


Pmc 


Prand 


overall 


9.91 


.1283 


.1271 


0.1280 


Ctrl 


7.43 


.0243 


.0267 


.0230 


base S.A. 


4.19 


.1229 


.1208 


.1233 


Sch. 


3.12 


.2098 


.2104 


.2101 


Ctrl 


9.57 


.0226 


.0229 


.0241 


NN S.A. 


6.36 


.0953 


.0968 


.0936 


Sch. 


2.91 


.4060 


.4052 


.4058 



Table 17: The x^-statistics for the segregation tests and the corresponding p- values (in parentheses). Ctrl = 
Control, S.A. = Schizoaffective, and Sch. = Schizophrenics. 

overall test of segregation is not significant; however, base- and NN-class-specific tests are significant for controls only, 
implying significant deviation in row 1 and column 1 (i.e., row and column for controls) than expected under CSR 
independence. Based on the NNCT for this data set, we observe that th e deviation is toward association of controls 
with schizoaffectives and vice versa. As mentioned in lPiggle et al.1 (|l99ll ). the locations of the pyramidal neurons for 
controls and schizoaffectives are more similar than those of schizophrenics. Thus, it is possible to support to the idea 
that alterations of cortical function may occur in schizophrenics, which might agree with an earlier suggestion that 
neuronal numbers are lower in the cerebral cortex of schizophrenic patients. 

Based on the NNCT-tests above, we conclude that the pyramidal neurons exhibit significant deviation from the 
CSR independence pattern for the control subjects toward the association of controls and schizoaffectives. Then, one 
might want to know what might be causing the association, and what is the type and level of interaction at different 
scales (i.e., distances between the neurons). To answer such questions, we also present the second-order analysis of 
the pyramidal neuron locations. We plot Ripley's (univariate) L-function for all data combined and for each (pooled) 
group in Figure [TT] where the upper and lower 95 % confidence bounds are also provided. Observe that for all neurons 
combined there is significant aggregation of locations (the ioo(^) — t curve is above the upper confidence bound) at 
distances t > 0.18. For small scales (t « 0.05), controls and schizophrenics exhibit significant regularity. At distances 
t > 0.20, neurons of schizoaffectives tend to be aggregated, while at distances t > 0.15 neurons of controls tend to be 
aggregated, while for distances up to 0.3, neurons of schizophrenics exhibit regularity. This is along the lines of the 
NNCT analysis results, which indicate deviation from CSR independence at smaller scales. The less spatial regularity 
of the schizoaffectives compared to the most regular pattern of schizophrenics might explain the association of neurons 
of controls and schizoaffectives. 

We also plot Ripley's bivariate L-function together with the upper and lower 95 % confidence bounds in Figure [121 
Observe that for almost all distances controls and schizoaffectives exhibit significant spatial association, while controls 
and schizophrenics exhibit significant spatial association for distances t ~ 0.04, and schizoaffectives and schizophrenics 
show significant association for distances t « 0.04 also. The average NN distance for this data set is 0.014 (± 0.007), 
so at smaller scales (i.e., t < 0.02) the univariate and bivariate L-functions seem to be in agreement with the NNCT 
results which indicate the association of controls and schizophrenics. 

Since Ripley's 7^-function is cumulative, we also present the pair correlation functions for all neurons and each 
group of neurons in Figure [T51 Observe that all neurons are aggregated around distance values of 0.04, 0.09, 0.16, and 
0.20; control neurons are aggregated for distance values of 0.1, 0.17, and 0.25; schizoaffective neurons are aggregated 
for distance values of 0.08, 0.10, 0.12, 0.16, and 0.20; and schizophrenic neurons exhibit regularity for distance values 
of 0.025. Comparing Figures [TTI and [T51 we see that Ripley's L and pair correlation functions detect the same patterns 
but with different distance values, so that Ripley's L implies that the particular pattern is significant for a wider range 
of distance values compared to g{t). But the results based on pair correlation function g{t) are more reliable. 

The bivariate pair correlation functions for the groups in pyramidal neuron tree data are plotted in Figure 1141 
Observe that control and schizoaffective neurons are associated for distance values of 0.04, 0.08, and 0.15; control 
and schizophrenic neurons do not significantly deviate from CSR independence for the considered range of distances, 
neither do the schizoaffective and schizophrenic neurons. Comparing Figures [TJ] and [TJ] we see that Ripley's L and 
pair correlation functions usually detect the same large-scale pattern but at narrower ranges of distance values for the 
latter. 
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Figure 11: Second-order properties of pyramidal neuron cells. Functions plotted are Ripley's univariate 
L-functions Lii{t) — t (left) for j = 0, 1, 2, 3 i = stands for all data combined, i = 1 for controls, i = 2 
for schizoaffectlves and i = 'i for schizophrenics. Wide dashed lines around are the upper and lower 95 
% confidence bounds for the L-functions based on Monte Carlo simulations under the CSR independence 
pattern. 

10 Discussion and Conclusions 

In this article we consider (Dixon's) overall and base-class-specific tests and introduce NN-class-specific tests of seg- 
regation based on NNCTs. We discuss the differences in these NNCT-tests, present the asymptotic properties of 
them, compare them using extensive Monte Carlo simulations under CSR independence and RL and under various 
segregation and associatio n alternatives for two- and three-class cases. We als o illustrate the tests and compare them 
with Ripley's L-fun c tion jRiplevI (|2004l )). Biggie's D-function (|Digglel (|2003l ') p. 131), and pair correlation function 
(|Stovan and StovanI (| 19941 )') on three data sets. 

NNCT-tests (i.e., overall and class-specific tests of segregation) are used in testing various forms of randomness 
in the NN structure between two or more classes. The overall test is used for testing any deviation from randomness 
in the cells of the NNCT; base-class-specific tests are used for testing any deviation from randomness in the NN 
distribution of the base class in question (i.e., any deviation in the corresponding row); NN-class-specific tests are 
used for testing any deviation from randomness in the base distribution of the classes to which the class in question 
serves as NN (i.e., any deviation in the corresponding column). The randomness in the NN structure is implied by the 
CSR independence or RL patterns. However, we demonstrate that under the CSR independence pattern, NNCT-tests 
are conditional on Q and R, while under the RL pattern, they are unconditional tests. In the two-class case, both 
NN-class-specific tests and the overall test yield identical results. Furthermore the three types of the NNCT-tests we 
consider are all consistent under their respective null hypotheses. 

The CSR independence pattern assumes that the study region is unbounded for the analyzed pattern, which is 
not the case in practice. So the edge (or boundary) effects might confound the test results in the analysis of empirical 
(i.e., bounded) data sets if the null pattern is CSR in dependence and much effort has gone into the development of 
edge correction methods (|Yamada and RogersenI (|2003l )h Two correction m ethods for the edge eff ects on NNCT-tests, 
namely buffer zone correction and toroidal correction, are investigated in (|Cevhanl (|2007l . l2008al )) where it is shown 
that the empirical sizes of the NNCT-tests are not affected by the toroidal edge correction under CSR independence. 
However, toroidal correction is biased for non-CSR patterns. In particular if the pattern outside the plot (which is 
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Figure 12: Second-order properties of pyramidal neuron cells. Functions plotted are Ripley's bivariate L- 
functions Lij{t) — t (right) for i ^ j i = I for controls, i ~ 2 for schizoaffectives and i ~ 3 for schizophrenics. 
Wide dashed lines around are the upper and lower 95 % confidence bounds for the L-functions based on 
Monte Carlo simulations under the CSR independence pattern. 

often unknown) is not the same as that inside it it yields questionable results (|Haasel (|l995l ) and lYamada and RogersenI 
The bias is more severe especially when there are clusters around the edges. Under CSR independence, the 
(outer) buffer zone edge correction method seems to have slightly stronger influence on the tests compared to toroidal 
correction. But for these tests, buffer zone correction does not ch a nge t he sizes significantly for most sample size 
combinations. This is in agreement with the findings of iBarot" et all (jl999l l who say that NN methods only require a 
small buffer area around the study region. A large buffer area does not help much since one only needs to be able 
to see far enough away from an event to find its NN. Once the buffer area extends past the likely NN distances (i.e., 
about the average NN distances), it is not adding much helpful information for NNCTs. Hence we recommend inner 
or outer buffer zone correction for NNCT-tests with the width of the buffer area being about the average NN distance. 
We do not recommend larger buffer areas, since they are wasteful with little additional gain. On the other hand, we 
recommend the use of toroidal edge correction with points within the average NN distance in the additional copies 
around the study region when there are no clusters around the edges and it is reasonable to assume the replicated 
structure for the pattern. For larger distance s, the ga i n mig ht not be worth the effort. Empirical edge corrections 
are available for other nearest neighbor tests (ISinclaij (Il985l\b They are potentially applicable to the NNCT-tests, 
however the buffer zone correction analysis in (|Cevhanl (20^)) suggests that such a correction will not make much 
difference. 

Based on our Monte Carlo simulations, we conclude that the asymptotic approximation for the NNCT-tests 
is appropriate only when each cell count in the NNCT is larger than 5; when at least one cell count is < 5, we 
recommend the Monte Carlo randomization version of the tests. For large samples so that each cell count is larger 
than 5, the NNCT-tests have similar empirical significance levels which are about the nominal level. When at least 
one sample size is small so that some cell count (s) are < 5, we compare the power of the tests using Monte Carlo 
critical values. For large samples, the power comparisons can be made using both the asymptotic or Monte Carlo 
critical values. Under the segregation alternatives, for large samples, we conclude that NN-class-specific tests have 
higher power; when at least one sample is small, base-class-specific test for the smaller (larger) class has the highest 
(lowest) power estimates. Under the association alternatives, we observe that when at least one sample size is small, 
base-class-specific test for the smaller class has extremely poor performance, while for the larger class it has higher 
power than the NN-class-specific tests. On the other hand, for larger samples, base-class-specific test for the largest 
(smallest) class has the highest (lowest) power. However base- and NN-class-specific tests answer different questions. 
So it is not appropriate to compare the NNCT-tests, as they provide related but different information about the NN 
structure in the data. 

NNCT-tests summarize the pattern in the data set for small scales; more specifically, they provide information 
on the pattern around the average NN distance between the points. On the other hand, pair correlation function 
g{t) and Ripley's classical K or L-functions and other variants provide information on the pattern at various scales. 
However, the classical L- function is not appropriate for the null pa ttern of RL when locations of the points have spatial 
inhomogeneity. For such cases. Biggie's D-function (|Digglel (|2003l ') p. 131) is more appropriate in testing the bivariate 
spatial clustering at various scales. 

Ripley's classical K or L-functions can be used when the null pattern can be assumed to be CSR independence, 
that is, when the null pattern assumes first-order homogeneity for e ach class. When the null pattern is the RL of points 
from an inhomogeneous Poisson process they are not appropriate (|Kulldorfj (|2006l )): Cuzick-Edward's fc-NN tests are 
designed for testing bivariate spatial interaction and mostly used for spatial clustering of cases in epidemiology; Biggie's 
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Figure 13: Pair correlation functions for all neurons combined and for each group in the pyramidal neuron 
data. Wide dashed lines around 1 (which is the theoretical value) are the upper and lower (pointwise) 95 % 
confidence bounds for the L-functions based on Monte Carlo simulation under the CSR independence pattern. 

-D-function is a modified version of Ripley's if- function jDigglel (|2003l )) and adjusts for any inhomogeneity in the 
loca tions of, e.g., ca s es and controls. Furthermore, there are variants of K{t) that explicitly correct for inhomogeneity 
(see iBaddelev et al.l (|2000l )'). Ripley's K-, Biggie's D-functions and pair correlation functions are designed to analyze 
univariate or bivariate spatial interaction at various scales (i.e., inter-point distances). Our examples illustrate that for 
distances around the average NN distance, NNCT-tests and Ripley's L- and Biggie's D-functions yield similar results. 
The NNCT-tests and the second-order analysis provide very similar information for the two-class case at small scales. 
However, for multi-class case with three or more classes, NNCT-tests provide information on the multivariate spatial 
interaction in one compound summary measure; while the second-order analysis requires performing all bivariate 
spatial interaction analysis. 

The overall NNCT-test and Ripley's L-function provide similar information in the two-class case at small scales. For 
the g-class case with q > 2 classes, overall tests provide information on the (small-scale) while the Ripley's L-function 
requires performing all bivariate spatial interaction analysis. On the other hand, the class-specific tests can serve as 
a mean of post hoc analysis only when the overall test is significant. Furthermore, the class-specific tests are testing 
the spatial interaction of one class with all classes (including itself) as part of the multivariate interaction between all 
the classes. On the other hand, Ripley's univariate K- or L-functions are restricted to one class and bivariate K- or 
L-functions are restricted to two classes they pertain to, ignoring the potentially important multivariate interaction 
between all classes in the study area. However, t here a re forms of the J-function which is derived from the well-known 
G and F functions (|van Lieshout and BaddelevI (|l999f )') a nd deal with this multi-type setting (i.e., consider the pattern 
of type i in the context of the pattern of all other types). Ivan Lieshout and BaddelevI (| 19991 ') define two basic types of 
J-functions. First is a type-i-to-type-j function which considers the points of type i in the context of the points of type 
j. The second one is the type- i-to-any- type function which considers the points of type i in the context of points of 
all types including type i. Other forms can be derived from them by re-defining the types. For example, if we want to 
consider the points of type i in the context of points of all other types, then we collapse all the other types j (i.e., all j 
which are not equal to i) into a single type i' and then use the type-^- to-type-^' function. Several a u thors have w ritten 
about t he biv ariate if- function, which is of the type-i-to-type-j form jBiggle and Chetwvndl (|l99ll ). lHaasel (|l995l ). and 
iBiggld (|2003[ )i. Type-i-to-type-j LT-function can easily be modified to type-i-to-any-type if- function. Thus, essentially 
there is only one family of multi-type if-functions in literature. But type-i-to-type-j if-function is comparable with 
a NNCT analysis based on a 2 x 2 NNCT restricted to the classes i and j. Similarly, type-i-to-type-i' if- function is 
comparable with the NNCT analysis based on a 2 x 2 NNCT with classes i and the rest of the classes labeled as i' . 
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Figure 14: Pair correlation lunctions for each pair ol groups in the pyramidal neuron data. Wide dashed 
lines around 1 (which is the theoretical value) are the upper and lower (pointwise) 95 % confidence bounds 
for the L-functions based on Monte Carlo simulations under the CSR independence pattern. Ctrl = Controls, 
S.A. = Schizoaffectives, Sch = Schizophrenics. 

Furthermore, type-i-to-type-i' Tv-function may yield similar information with the base-class-specific test for class i at 
small scales, but the A'-function ignores the interaction of class i with itself. On the other hand, by definition none 
of the variants of the A-function is comparable with the NN-class-specific tests, as they provide information on very 
different aspects of the spatial interaction between classes. 

Since pairw ise analysis of q classes with 2x2 NNCTs might yield conflicting results compared to g x g NNCT analysis 
(|Dixonl (|2002al ')'). Ripley's //-function and NNCT-tests might also yield conflicting results at small distances. Hence 
Ripley's L-function and NNCT-tests may provide similar but not identical information about the spatial pattern and 
the latter might provide small-scale interaction that is not detected by the former. Since the pair correlation functions 
are derivatives of Ripley's A'-function, most of the above discussion holds for them also, except g{t) is reliable only 
for large scale interaction analysis. Hence NNCT-tests and pair correlation function are not comparable but provide 
complimentary information about the pattern in question. 

For a data set for which CSR independence is the reasonable null pattern, we recommend the overall segregation 
test if the question of interest is the spatial interaction at small scales (i.e., about the mean NN distance). If it yields 
a significant result, then to determine which classes have significant spatial interaction with themselves and all other 
classes, the class-specific tests can be performed. One can also perform Ripley's A or L-function and only consider 
distances up to around the average NN distance and compare the results with those of NNCT analysis. I f the spatial 
interaction at higher scales is of interest, pair correlation function is recommended (|Loosmore and FordI (|2006r ii. due 
to the cumulative nature of Ripley's A- or //-functions for larger distances. On the other hand, if the RL pattern is 
the reasonable null pattern for the data, we recommend the NNCT-tests if the small-scale interaction is of interest 
and Biggie's D-function if the spatial interaction at higher scales is also of interest. 
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